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IMPROVED ADENOVIRUS VECTORS 



FIELD OF THE INVENTION 

The invention relates to improved adenovirus vectors, and more specifically, 
adenovirus vectors useful for gene therapy. 

BACKGROUND 

Adenoviruses (Ad) are double-stranded DNA viruses. The genome of adenoviruses 
(-36 kb) is complex and contains over 50 open reading frames (ORFs). These ORFs are 
overlapping and genes encoding one protein are often embedded within genes coding for other 
Ad proteins. Expression of Ad genes is divided into an early and a late phase. Early genes 
are those transcribed prior to replication of the genome while late genes are transcribed after 
replication. The early genes comprise El a. Elb. E2a. E2b. E3 and E4. The E la cene 
products are involved in transcriptional regulation: the Elb gene products are involved in the 
shut-off of host cell functions and mRNA transport. E2a encodes the a DNA-binding protein 
(DBP): E2b encodes the viral DNA polymerase and preterminal protein (pTP). The E3 gene 
products are not essential for viral growth in cell culture. The E4 region encodes regulatory 
protein involved in transcriptional and post-transcriptional regulation of viral gene expression: 
a subset of the E4 proteins are essential for viral growth. The products of the late genes (e.g., 
Ll-5) are predominantly components of the virion as well as proteins involved in the 
assembly of virions. The VA genes produce VA RNAs which block the host cell from 
shutting down viral protein synthesis. 

Adenoviruses or Ad vectors have been exploited for the delivery of foreign genes to 
cells for a number of reasons including the fact that Ad vectors have been shown to be highly 
effective for the transfer of genes into a wide variety of tissues in vivo and the fact that Ad 
infects both dividing and non-dividing cells: a number of tissues which are targets for gene 
therapy comprise largely non-dividing cells. 

The current generation of Ad vectors suffer from a number of limitations which 
preclude their widespread clinical use including: 1) immune detection and elimination of cells 
infected with Ad vectors. 2) a limited carrying capacity (about 8.5 kb) for the insertion of 
foreign genes and regulatory elements, and 3) low-level expression of Ad genes in cells 
infected with recombinant Ad vectors (generally, the expression of Ad proteins is toxic to 
cells). 
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The latter problem was thought to be solved by using vectors containing deletions in 
the El region of the Ad genome (El gene products are required for viral gene expression and 
replication). However, even with such vectors, low-level expression of Ad genes is observed. 
It is now thought that most mammalian cells contain El -like factors which can substitute for 
5 the missing Ad El proteins and permit expression of Ad genes remaining on the El deleted 
vectors. 

What is needed is an approach that overcomes the problem of low level expression of 
Ad genes. Such an approach needs to ensure that adenovirus vectors are safe and non- 
immunogenic. 

10 

SUMMARY OF THE INVENTION 

The present invention contemplates two approaches to improving adenovirus vectors. 
The first approach generally contemplates a recombinant plasmid. together with a helper 
adenovirus, in a packaging cell line. The helper adenovirus is rendered safe by utilization of 
15 loxP sequences. In the second approach, "damaged" adenoviruses are employed. While the 
"damaged" adenovirus is capable of self-propagation in a packaging cell line, it is not capable 
of expressing certain genes {e.g.* the DNA polymerase gene and/or the adenovirus preterminal 
protein gene). 

In one embodiment of the first approach, the present invention contemplates a 
20 recombinant plasmid. comprising in operable combination: a) a plasmid backbone. 

comprising an origin of replication, an antibiotic resistance gene and a eukaryotic promoter 
element: b) the left and right inverted terminal repeats fITRs) of adenovirus, said ITRs each 
having a 5' and a 3" end and arranged in a tail to tail orientation on said plasmid backbone: 
c) the adenovirus packaging sequence, said packaging sequence having a 5* and a 3' end and 
25 linked to one of said ITRs: and d) a first gene of interest operably linked to said promoter 
element. 

While it is not intended that the present invention be limited by the precise size of the 
plasmid. it is generally desirable that the recombinant plasmid have a total size of between 27 
and 40 kilobase pairs. It is preferred that the total size of the DNA packaged into an EAM 
30 derived from these recombinant plasmids is about the length of the wild-type adenovirus 

genome (-36 kb). It is well known in the an that DNA representing about 105% of the wild- 
type length may be packaged into a viral particle: thus the EAM derived from recombinant 
plasmid may contain DNA whose length exceeds by -105% the size of the wild-type genome. 
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The size of Die recombinant plasmid may be adjusted using reporter genes and genes of 
•merest having various sizes (including the use of different sizes of introns within these ,enes 
as well as through the use of irrelevant or non-coding DNA fragment which act as "stuffer" 
fragments {e.g.. ponions of bacteriophage genomes). 

In one embodiment of the recombinant plasmid. said 5" end of said packas.ng 
sequence is linked to said 3* end of said left ITR. ] n this embodiment, said first Bene of 
interest is linked to said 3" end of said packaging sequence. 1, is not intended that the present 
invention be limited by the nature of the gene of interest: a variety of genes (including both 
cDNA and genomic forms) are contemplated: any gene having therapeutic value may be 
inserted into the recombinant plasmids of the present invention. For example, the transfer of 
the adenosine deaminase (ADA) gene is useful for the treatment of ADA- patients: the 
transfer of the CFTR gene is useful for the treatment of cyanic fibrosis. A wide variety of 
diseases are known to be due to a defect in a single gene. The plasmids. vectors and EAMs 
of the present invention are useful for the transfer of a non-mutated form of a gene which is 
mutated in a patient thereby resulting in disease. The present invention is illustrated using 
recombinant plasmids capable of generating encapsidated adenovirus minichromosomes 
(EAMs) containing the dystrophin cDNA gene (the cDNA form of this gene is preferred due 
to the large size of this gene): the dystrophin gene is non-functional in muscular dvstrophv 
(MD) patients. However, the present invention is not limited toward the use of the dystrophin 
gene for treatment of MD: the use of the utrophin (also called the dystrophin related protein) 
gene is also contemplated for gene therapy for the treatment of MD [Tinsley e, al. (1993) 
Curr. Opin. Genet. Dev. 3:484 and (1992) Nature 360:591]: the utrophin gene protein has 
been reported to be capable of functionally substituting for the dystrophin gene [Tinsley and 
Davies ,1993) Neuromusc. Disord. 3:539]. As the utrophin gene product is expressed in the 
muscle of muscular dystrophy patients, no immune response would be directed against the 
utrophin gene product expressed in cells of a host (including a human) containing the 
recombinant plasmids. Ad vectors or EAMs of the present invention. While the present 
invention is illustrated using plasmids containing the dystrophin gene, the plasmids. Ad 
vectors and EAMs of the present invention have broad application for the transfer of any gene 
whose gene product is missing or altered in activity in cells. 

Embodiments are contemplated wherein the recombinant plasmid further comprises a 
second gene of interest. In one embodiment, said second gene of interest is linked to said 3" 
end of said right ITR. In one embodiment, said second gene of interest is a reporter gene. A 
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variety of reporter genes are contemplated, including but not limited to E. coli P-palactosidase 

gene, the human placental alkaline phosphatase gene, the green fluorescent protein gene and 
the chloramphenicol acetyltransferase gene. 

As mentioned above, the first approach also involves the use of a helper adenovirus in 

5 combination with the recombinant plasmid. In one embodiment, the present invention 

contemplates a helper adenovirus comprising i) first and a second loxP sequences, and ii) the 
adenovirus packaging sequence, said packaging sequence having a 5" and a 3' end. It is 
preferred that said first loxP sequence is linked to the 5' end of said packaging sequence and 
said second loxP sequence is linked to said 3 ? end of said packaging sequence. In one 

10 embodiment, the helper virus comprises at least one adenovirus gene coding region. 

The present invention contemplates a mammalian cell line containing the above- 
described recombinant plasmid and the above-described helper virus. It is preferred that said 
cell line is a 293-derived cell line. Specifically, in one embodiment, the present invention 
contemplates a mammalian cell line, comprising: a) a recombinant plasmid. comprising, in 

15 operable combination: i) a plasmid backbone, comprising an origin of replication, an 

antibiotic resistance gene and a eukaryotic promoter element, ii) the left and right inverted 
terminal repeats (ITRs) of adenovirus, said ITRs each having a 5' and a 3 ! end and arranged 
in a tail to tail orientation on said plasmid backbone, iii) the adenovirus packaging sequence, 
said packaging sequence having a 5' and a 3" end and linked to one of said ITRs. and iv) a 

20 first gene of interest operably linked to said promoter element: and b) a helper adenovirus 
comprising i) first and a second loxP sequences, and ii) the adenovirus packaging sequence, 
said packaging sequence having a 5* and a T end. As noted previously, said helper can 
further comprise at least one adenovirus gene coding region. 

Overall, the first approach allows for a method of producing an adenovirus 

25 minichromosome. In one embodiment, this method comprises: A) providing a mammalian 
cell line containing: a) a recombinant plasmid. comprising, in operable combination, i) a 
plasmid backbone, comprising an origin of replication, an antibiotic resistance gene and a 
eukaryotic promoter element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5 ? and a 3' end and arranged in a tail to tail orientation 

30 on said plasmid backbone, iii) the adenovirus packaging sequence, said packaging sequence 
having a 5* and a 3* end and linked to one of the ITRs. and iv) a first gene of interest 
operably linked to said promoter element; and b) a helper adenovirus comprising i) first and a 
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second la.xP sequences, ii) at least one adenovirus gene coding region, and iii) ,h e adenovirus 
packaging sequence, said packaging sequence having a 5' and a 3' end: and 
B) growing said cell line under conditions such that said adenovirus gene coding region is 
expressed and said recombinant plasmid directs the production of at least one adenoviral 
minichromosome. It is desired that said adenovirus minichromosome is encapsidated. 

In one embodiment, the present invention contemplates recovering said encapsidated 
adenovirus minichromosome and. in turn, purifying said recovered encapsidated adenovirus 
minichromosome. Thereafter, said purified encapsidated adenovirus minichromosome can be 
administered to a host (e.g., a mammal). Human therapy is thereby contemplated. 

It is not intended that the present invention be limited by the nature of the 
administration of said minichromosomes. All types of administration are contemplated, 
including direct injection (intra-muscular. intravenous, subcutaneous, etc.). inhalation, etc. 

As noted above, the present invention contemplates a second approaches to improving 
adenovirus vectors. In the second approach, "damaged" adenoviruses are employed. In one 
embodiment, the present invention contemplates a recombinant adenovirus comprising the 
adenovirus E2b region having a deletion, said adenovirus capable of self-propagation in a 
packaging cell line and said E2b region comprising the DNA polymerase gene and the 
adenovirus preterminal protein gene. In this embodiment, said deletion can be within the 
adenovirus DNA polymerase gene. Alternatively, said deletion is within the adenovirus 
preterminal protein gene. Finally, the present invention also contemplates embodiments 
wherein said deletion is within the adenovirus DNA polymerase and preterminal protein 
genes. 

The present invention further provides cell lines capable of supporting the propagation 
of Ad virus containing deletions within the E2b region. In one embodiment the invention 
provides a mammalian cell line stably and constitutively expressing the adenovirus El gene 
products and the adenovirus DNA polymerase. In one embodiment, these cell lines comprise 
a recombinant adenovirus comprising a deletion within the E2b region, this E2b-deleted 
recombinant adenovirus being capable of self-propagation in the cell line. The present 
invention is not limited by the nature of the deletion within the E2b region. In one 
embodiment, the deletion is within the adenoviral DNA polymerase gene. 

The present invention provides cells lines stably expressing El proteins and the 
adenoviral DNA polymerase, wherein the genome of the cell line contains a nucleotide 
sequence encoding adenovirus DNA polymerase operably linked to a heterologous promoter. 
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In a particularly preferred embodiment, the cell line is selected from the group consisting of 
the B-6. B-9. C-l. C-4. C-7. C-13. and C-14 cell lines. 

The present invention further provides cell lines which further constitutively express 
the adenovirus preterminal protein (pTP) gene product (in addition to El proteins and DNA 
5 polymerase). In one embodiment, these pTP-expressing cell lines comprise a recombinant 
adenovirus comprising a deletion within the E2b region, the recombinant adenovirus being 
capable of self-propagation in the pTP-expressing cell line. In a preferred embodiment, the 
deletion within the E2b region comprises a deletion within the adenoviral preterminal protein 
gene. In another preferred embodiment, the deletion within the E2b region comprises a 

10 deletion within the adenoviral (Ad) DNA polymerase and preterminal protein genes. 

In a preferred embodiment, the cell lines coexpressing pTP and Ad DNA polymerase, 
contain within their genome, a nucleotide sequence encoding adenovirus preterminal protein 
operably linked to a heterologous promoter. In the invention is not limited by the nature of 
the heterologous promoter chosen. The art knows well how to select a suitable heterologous 

15 promoter to achieve expression in the desired host cell 293 cells or derivative thereof). 
In a particularly preferred embodiment, the pTP- and Ad polymerase-expressing cell line is 
selected from the group consisting of the C-l, C-4. C-7. C-13. and C-14 cell lines. 

The present invention provides a method of producing infectious recombinant 
adenovirus particles containing an adenoviral genome containing a deletion within the E2b 

20 region, comprising: a) providing: i) a mammalian cell line stably and constitutively 
expressing the adenovirus El gene products and the adenovirus DNA polymerase: ii) a 
recombinant adenovirus comprising a deletion within the E2b region, the recombinant 
adenovirus being capable of self-propagation in said cell line: b) introducing the recombinant 
adenovirus into the cell line under conditions such that the recombinant adenovirus is 

25 propagated to form infectious recombinant adenovirus particles: and c) recovering the 

infectious recombinant adenovirus particles. In a preferred embodiment, the method further 
comprises d) purifying the recovered infectious recombinant adenovirus particles. In yet 
another preferred embodiment, the method further comprises e) administering the purified 
recombinant adenovirus particles to a host which is preferably a mammal and most preferablv 

30 a human. 

In another preferred embodiment the mammalian cell line employed in the above 
method further constitutively expresses the adenovirus preterminal protein. 
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The present invention further provides a recombinant plasmid capable of replicating in 
a bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the polymerase gene, the deletion resulting in reduced polymerase activity. The 
present invention is not limited by the specific deletion employed to reduce polymerase 
activity. In a preferred embodiment, the deletion comprises a deletion of nucleotides 8772 to 
9385 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 
designation pApol. In another preferred embodiment, the recombinant plasmid has the 
designation pBHGHApol. 

The present invention also provides a recombinant plasmid capable of replicating in a 
bacterial host comprising adenoviral E2b sequences, the E2b sequences containing a deletion 
within the preterminal protein gene, the deletion resulting in the inability to express functional 
preterminal protein without disruption of the VA RNA genes. The present invention is not 
limited by the specific deletion employed to render the pTP inactive: any deletion within the 
pTP coding region which does not disrupt the ability to express the Ad VA RNA genes may 
15 be employed. In a preferred embodiment, the deletion comprises a deletion of nucleotides 
10.705 to 1 1.134 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid 
has the designation pApTP. In another preferred embodiment, the recombinant plasmid has 
the designation pBHGHApTP. 

In a preferred embodiment, the recombinant plasmid containing a deletion with the 
20 pTP region further comprises a deletion within the polymerase gene, this deletion resulting in 
reduced (preferably absent) polymerase activity. The present invention is not limited by the 
specific deletion employed to inactivate the polymerase and pTP genes. In a preferred 
embodiment, the deletion comprises a deletion of nucleotides 8.773 to 9586 and 11.067 to 
12.513 in SEQ ID NO:4. In one preferred embodiment, the recombinant plasmid has the 
25 designation P AXBApolApTPVARNA+tl3. In another preferred embodiment, the recombinant 
plasmid has the designation pBHGl lApolApTPVARNA+tl3. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1A is a schematic representation of the Ad polymerase expression plasmid 
30 pRSV-pol indicating that the Ad2 DNA polymerase sequences are under the transcriptional 
control of the RSV-LTR/promoter element and are flanked on the 3' end by the SV-40 small 
t intron and SV-40 polyadenylation addition site. 
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Figure IB is a schematic representation of the expression plasmid pRSV-pTP 
indicating that the Ad2 preterminal protein sequences are under the transcriptional control of 
the RSV-LTR/promoter element and are Hanked on the 3" end by the SV-40 small-t intron 
and SV-40 poiyadenylation signals. 

Figure 2 is an ethidium bromide-stained gel depicting the presence of Ad pol DNA 
sequences in genomic DNA from LP-293 cells and several hygromycin-resistant cell lines. 
The -750 bp PCR products are indicated by the arrow. 

Figure 3 is an autoradiograph depicting the results of a viral replication- 
complementation assay analyzing the functional activity of the Ad polymerase protein 
10 expressed by LP-293 cells and several hygromycin resistant cell lines. The 8.010 bp HindlU 
fragments analyzed by densitometry are indicated by an arrow. 

Figure 4 is an autoradiograph indicating that cell lines B-6 and C-7 contained a 
smaller and a larger species of Ad polymerase mRNA while LP-293 derived RNA had no 
detectable hybridization signal. The location of the two species of Ad polymerase mRNA are 
15 indicated relative to the 28S and 18S ribosomal RNAs. and the aberrant transcript expressed 
by the B-9 cell line is indicated by an arrow. 

Figure 5 is an autoradiograph showing which of the cell lines that received the 
preterminal protein expression plasmid indicated the presence of pRSV-pTP sequences (arrow 
labelled M pTP") and El sequences (arrow labelled "El"). 
20 Figure 6 is an autoradiograph indicating in which of the cell lines transcription of 

preterminal protein is occurring (arrow labelled 3kb"). 

Figure 7 is an autoradiograph indicating that the expression of the Ad-polymerase 
could overcome the replication defect of USsub\00 at non-permissive temperatures. 

Figure 8 graphically depicts plaque titre for LP-293, B-6 and C-7 cell lines infected 
25 with wtAd5. H5ts36. or H5.yw6100. and the results demonstrate that the C-7 ceil line can be 
used as a packaging cell line to allow the high level growth of El. preterminal, and 
polymerase deleted Ad vectors. 

Figure 9 is an autoradiograph showing that the recombinant pol* virus is viable on 
pol-expressing 293 cells but not on 293 cells which demonstrates that recombinant Ad viruses 
30 containing the 612 bp deletion found within pApol lack the ability to express Ad polymerase. 

Figure 10 is a schematic representation of the structure of pAdSfidys wherein the two 
inverted adenovirus origins of replication are represented by a left and right inverted terminal 
repeat (LITR/RJTR). PI and P2 represent location of probes used for Southern blot analysis. 

- 8 - 
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Figure 1 1 graphically illustra.es the total number of transducing adenovirus panicles 
produced (output) per serial passage on 293 cells, total input virus of either the helper (hpAP) 
or AdSPdys. and the total number of cells used in each infection. 

Figures 12A-B show Southern blot analyses of viral DNA from lysates 3. 6. 9 and 12. 
digested with the restriction enzymes BssHll. Nru\ and £«,RV. For the analyses, fragments" 
from the C terminus of mus musculus dystrophin cDNA (A) or the N terminus of E coli p. 
galactosidase (B) were labeled with dCTP 32 and used as probes. 

Figures 13A-B show the physical separation of Ad5pdys from hpAP virions at the 
third (A) and final (B) stages of CsCl purification. 

Figure 14 graphically depicts the level of contamination of Ad5pdys EAMs by hpAP 
virions obtained from the final stage of CsCl purification as measured by P-galactosidase and 
alkaline phosphatase expression. The ratio of the two types of virions - Ad5pdys EAMs 
(LacZ) or hpAP (AP) in each fraction is indicated in the lower graph. 

Figures 15A-B are western blot (immunoblot) analyses of protein extracts from mdx 
myoblasts and myotubes demonstrating the expression of P-galactosidase (A) and dystrophin 
(B) in cells infected with Ad5pdys EAMs. 

Figures 16A-C depict immunofluorescence of dystrophin expression in wild type 
MM 1 4 myotubes (A), uninfected mdx (B) and infected mdx myotubes (C). 

Figure 17 is a schematic representation of the MCK/lacZ constructs tested to determine 
what portion of the - 3.3 kb DNA fragment containing the enhancer/promoter of the MCK 
gene is capable of directing high levels of expression of linked genes in muscle cells. 

Figure 18 is a schematic representation of a GFP/p-gaJ reporter construct suitable for 
assaying the expression of Cre recombinase in mammalian cells. 

Figure 19 is a schematic representation of the recombination event between the loxP 
shuttle vector and the Ad5d77001 genome. 

DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 

The term "gene" refers to a DNA sequence that comprises control and coding 
sequences necessary for the production of a polypeptide or precursor thereof. The polypeptide 
can be encoded by a full length coding sequence or by any portion of the coding sequence so 
long as the desired enzymatic activity is retained. The term "gene" encompasses both cDNA 
and genomic forms of a given gene. 
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The term "wild-type" refers to a gene or gene product which has the characteristics of 
that gene or gene product when isolated from a naturally occurring source. A wild-type gene 
is that which is most frequently observed in a population and is thus arbitrarily designated the 
"normal" or "wild-type" form of the gene. In contrast, the term "modified" or "mutant" refers 
to a gene or gene product which displays modifications in sequence and or functional 
properties (i.e.. altered characteristics) when compared to the wild-type gene or gene product. 
It is noted that naturally-occurring mutants can be isolated; these are identified by the fact that 
they have altered characteristics when compared to the wild-type gene or gene product. 

The term "oligonucleotide" as used herein is defined as a molecule comprised of two 
or more deoxyribonucleotides or ribonucleotides, usually more than three (3). and typically 
more than ten (10) and up to one hundred (100) or more (although preferably between twentv 
and thirty). The exact size will depend on many factors, which in turn depends on the 
ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in any 
manner, including chemical synthesis. DNA replication, reverse transcription, or a 
combination thereof. 

As used herein, the term "regulatory element" refers to a genetic element which 
controls some aspect of the expression of nucleic acid sequences. For example, a promoter is 
a regulatory element which facilitates the initiation of transcription of an operably linked 
coding region. Other regulatory elements are splicing signals, polyadenylation signals, 
termination signals, etc. (defined infra). 

Transcriptional control signals in eucaryotes comprise "promoter" and "enhancer" 
elements. Promoters and enhancers consist of short arrays of DNA sequences that interact 
specifically with cellular proteins involved in transcription [Maniatis. T. ei ai. Science 
236:1237 (1987)]. Promoter and enhancer elements have been isolated from a variety of 
eukaryotic sources including genes in yeast, insect and mammalian cells and viruses 
(analogous control elements, i.e., promoters, are also found in procaryotes). The selection of 
a particular promoter and enhancer depends on what cell type is to be used to express the 
protein of interest. Some eukaryotic promoters and enhancers have a broad host range while 
others are functional in a limited subset of cell types [for review see Voss. S.D. ei ai. Trends 
Biochem. Sci.. 1 1:287 (1986) and Maniatis. T. el ai. supra (1987)]. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired coding sequence and appropriate DNA sequences necessary for the 
expression of the operably linked coding sequence in a particular host organism (e.g. 
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mammal). DNA sequences necessary for expression in procaryotes include a promo.er. 
optionally an operator sequence, a ribosome binding sue and possibly other sequences. 
Eukaryotic cells are known to utilize promoters, polyadenlyation signals and enhancers. 

The terms "in operable combination", "in operable order" and "operablv linked" as 
used herein refer to the linkage of nucleic acid sequences in such a manner that a nucleic and 
molecule capable of directing the transcription of a given gene and/or the synthesis of a 
desired protein molecule is produced. The term also refers to the linkage of amino acid 
sequences in such a manner so that a functional protein is produced. 

The term "genetic cassette" as used herein refers to a fragment or segment of DNA 
containing a particular grouping of genetic elements. The cassette can be removed and 
inserted into a vector or plasmid as a single unit. A plasmid backbone refers to a piece of 
DNA containing at least plasmid origin of replication and a selectable marker uene (e.g.. an 
antibiotic resistance gene) which allows for selection of bacterial hosts containing the plasmid: 
the plasmid backbone may also include a polylinker region to facilitate the insertion of 
genetic elements within the plasmid. When a particular plasmid is modified to contain non- 
plasmid elements <«.*.. insertion of Ad sequences and/or a eukaryotic gene of interest linked 
to a promo.er element), the plasmid sequences are referred to as the plasmid backbone. 

Because mononucleotides are reacted to make oligonucleotides in a manner such that 
the 5" phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of its 
neighbor in one direction via a phosphodiester linkage, an end of an oligonucleotide is 
referred to as the "5" end" if its 5" phosphate is not linked to the 3" oxygen of a 
mononucleotide pentose ring and as the "3" end" if its 3" oxygen is not linked to a 5" 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucle.c acid 
sequence, even if internal to a larger oligonucleotide, also may be said to have 5" and 3" ends. 

When two different, non-overlapping oligonucleotides anneal to different regions of 
the same linear complementary nucleic acid sequence, and the 3' end of one oligonucleotide 
points towards the 5" end of the other, the former may be called the "upstream" 
oligonucleotide and the latter the "downstream" oligonucleotide. 

The term "primer" refers to an oligonucleotide which is capable of acting as a point of 
initiation of synthesis when placed under conditions in which primer extension is initiated. 
An oligonucleotide "primer" may occur naturally, as in a purified restriction digest or may be 
produced synthetically. 
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A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize with a 
template strand for primer elongation to occur. A primer sequence need not reflect the exact 
sequence of the template. For example, a non-complementary nucleotide fragment may be 
5 attached to the 5* end of the primer, with the remainder of the primer sequence being 

substantially complementary to the strand. Non-complementary bases or longer sequences can 
be interspersed into the primer, provided that the primer sequence has sufficient 
complementarity with the sequence of the template to hybridize and thereby form a template 
primer complex for synthesis of the extension product of the primer. 

10 "Hybridization" methods involve the annealing of a complementary sequence to the 

target nucleic acid (the sequence to be detected). The ability of two polymers of nucleic acid 
containing complementary sequences to find each other and anneal through base pairing 
interaction is a well-recognized phenomenon. The initial observations of the "hybridization" 
process by Marmur and Lane. Proc. Natl Acad. Scl USA 46:453 (1960) and Doty at al, 

15 Proc. Xatl. Acad Sci. USA 46:461 (1960) have been followed by the refinement of this 
process into an essential tool of modern biology. 

The complement of a nucleic acid sequence as used herein refers to an oligonucleotide 
which, when aliened with the nucleic acid sequence such that the 5" end of one sequence is 
paired with the 3* end of the other, is in "antiparallel association." Certain bases not 

20 commonly found in natural nucleic acids may be included in the nucleic acids of the present 
invention and include, for example, inosine and 7-deazaguanine. Complementarity need not 
be perfect: stable duplexes may contain mismatched base pairs or unmatched bases. Those 
skilled in the art of nucleic acid technology can determine duplex stability empirically 
considering a number of variables including, for example, the length of the oligonucleotide. 

25 base composition and sequence of the oligonucleotide, ionic strength and incidence of 
mismatched base pairs. 

Stability of a nucleic acid duplex is measured by the melting temperature, or "T m ." 
The T m of a particular nucleic acid duplex under specified conditions is the temperature at 
which on average half of the base pairs have disassociated. The equation for calculating the 

30 T m of nucleic acids is well known in the art. 

The term "probe" as used herein refers to a labeled oligonucleotide which forms a 
duplex structure with a sequence in another nucleic acid, due to complementarity of at least 
one sequence in the probe with a sequence in the other nucleic acid. 
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The term "label" as used herein refers to any aiom or molecule which can be used 10 
provide a detectable (preferably quantifiable) signal, and which can be attached to a nucleic- 
acid or protein. Labels may provide signals detectable by fluorescence, radioactivity, 
colorimetry. gravimetry. X-ray diffract.on or absorption, magnetism, enzymatic activity, and 
the like. 

The terms "nucleic acid substrate" and nucleic acid template" are used herein 
interchangeably and refer to a nucleic acid molecule which may comprise single- or double- 
stranded DNA or RNA. 

"Oligonucleotide primers matching or complementary to a gene sequence" refers to 
oligonucleotide primers capable of facilitating the template-dependent synthesis of single or 
double-stranded nucleic acids. Oligonucleotide primers matching or complementary to a gene 
sequence may be used in PCRs. RT-PCRs and the like. 

A "consensus gene sequence" refers to a gene sequence which is derived by 
comparison of two or more gene sequences and which describes the nucleotides most often 
present in a given segment of the genes: the consensus sequence is the canonical sequence. 

The term "polymorphic locus" is a locus present in a population which shows variation 
between members of the population (/.,.. the most common allele has a frequency of less than 
0.95). In contrast, a "monomorphic locus" is a genetic locus at little or no variations seen 
between members of the population (generally taken to be a locus at which the most common 
allele exceeds a frequency of 0.95 in the gene pool of the population). 

The term "microorganism" as used herein means an organism too small to be observed 
with the unaided eye and includes, but is not limited to bacteria, muses, protozoans, fungi, 
and ciliates. 

The term "microbial gene sequences" refers to gene sequences derived from a 
microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic. intracellular parasites incapable 
of autonomous replication {i.e.. replication requires the use of the host cell's machinery). 
Adenoviruses, as noted above, are double-stranded DNA v,ruses. The left and right inverted 
terminal repeats (ITRs) are short elements located at the 5" and 3" termini of the linear Ad 
genome, respectively and are required for replication of the viral DNA. The left ITR is 
located between 1-130 bp in the Ad genome (also referred to as 0-0.5 mu). The right ITR is 



- 13 - 



WO 98/17783 PCT7US97/19541 

located from -3.7500 bp to the end of the genome (also referred to as 99.5-100 mu). The two 

ITRs are inverted repeats of each other. For clarity, the left ITR or 5* end is sued define the 
5* and 3* ends of the ITRs. The 5* end of the left ITR is located at the extreme 5' end of the 
linear adenoviral genome; picturing the left ITR (LITR) as an arrow extending from the 5* 
end of the genome, the tail of the 5* ITR is located ai mu 0 and the head of the left ITR is 
located at - 0:5 mu (further the tail of the left ITR is referred to as the 5* end of the left ITR 
and the head of the left ITR is referred to as the 3* end of the left ITR). The tail of the right 
or 3* ITR is located at mu 100 and the head of the rigt ITR is located at - mu 99.5: the head 
of the right ITR is referred to as the 5' end of the right ITR and the tail of the right ITR is 
referred to as the 3" end of the right ITR (RJTR). In the linear Ad genome, the ITRs face 
each other with the head of each ITR pointing inward toward the bulk of the genome. When 
arranged in a "tail to tail orientation" the tails of each ITR (which comprise the 5' end of the 
LITR and the 3* end of the RJTR) are located in proximity to one another while the heads of 
each ITR are separated and face outward (see for example, the arrangement of the ITRs in the 
EAM shown in Figure 10 herein). The "adenovirus packaging sequence" refers to the 4* 
sequence which comprises five (AI-AV) packaging signals and is required for encapsidation 
of the mature linear genome: the packaging signals are located from -194 to 358 bp in the Ad 
genome (about 0.5-1.0 mu). 

The phrase "at least one adenovirus gene coding region" refers to a nucleotide 
sequence containing more than one adenovirus gene coding gene. A "helper adenovirus" or 
"helper virus" refers to an adenovirus which is replication-competent in a particular host cell 
(the host may provide Ad gene products such as El proteins), this replication-competent virus 
is used to supply m irems functions (e.g.. proteins) which are lacking in a second replication- 
incompetent virus: the first replication-competent virus is said to "help" the second 
replication-incompetent virus thereby permitting the propagation of the second viral genome 
in the cell containing the helper and second viruses. 

The term "containing a deletion within the E2b region" refers to a deletion of at least 
one basepair (preferably more than one bp and preferably at least 100 and most preferably 
more than 300 bp) within the E2b region of the adenovirus genome. An E2b deletion is a 
deletion that prevents expression of at least one E2b gene product and encompasses deletions 
within exons encoding portions of E2b-specific proteins as well as deletions within promoter 
and leader sequences. 
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An adenovtrus minichromosome" refers to a linear molecule of DNA containing the 

Ad ITRs on each end which is generated from a pJasmid containing the ITRs and one or more 
gene of interest. The term "encapsidmed adenovirus minichromosome" or "EAM" refers to an 
adenovirus minichromosome which has been packaged or encapsidated into a viral particle' 
plasmids containing the Ad ITRs and the packaging stgnai are shown herein to produce 
EAMs. When used herein, "recovering" encapsidated adenovirus minichromosomes refers to 
the collection of EAMs from a cell containing an EAM plasmid and a helper virus: this cell 
will direct the encapsidation of the minichromosome to produce EAMs. The EAMs may be 
recovered from these cells by lysis of the cell (,*.. freeze-thawing) and pelleting of the'ce.l 
debns to a eel) extract as described in Example 1 (Ex. 1 describes the recovery of Ad virus 
from a cell, but the same technique is used to recover EAMs from a cell). "Purifying" such 
minichromosomes refers to the isolation of the recovered EAMs in a more concentrated form 
(relative to the cell iysate) on a density gradient as described in Example 7: purification of 
recovered EAMs permits the physical separation of the EAM from any helper virus (if 
present). 

The term "transfection" as used herein refers to the introduction of foreign DNA into 
eukaryotic cells. Transfection may be accomplished by a variety of means known to the art 
including calcium phosphate-DNA co-precipitation. DEAE-dextran-mediated transfection. 
polybrene-mediated transfection. electroporation. microinjection, liposome fusion, iipofection. 
protoplast fusion, retroviral infection, and biolistics. 

The term "stable transfection" or "stably transfected" refers to the introduction and 
integration of foreign DNA into the genome of the transfected cell. The term "stable 
transfectant" refers to a cell which has stably integrated foreign DNA into the genomic DNA. 

As used herein, the term "gene of interest" refers to a gene inserted into a vector or 
plasmid whose expression is desired in a host cell. Genes of interest include genes having 
therapeutic value as well as reporter genes. A variety of such genes are contemplated, 
including genes of interest encoding a protein which provides a therapeutic function (such as 
the dystrophin gene, which is capable of correcting the defect seen in the muscle of MD 
patients), the utrophin gene, the CFTR gene (capable of correcting the defect seen in cystitic 
fibrosis patients), etc.. 

The term "reporter gene" indicates a gene sequence that encodes a reporter molecule 
(including an enzyme). A "reporter molecule" is detectable in any detection system, 
including, but not limited to enzyme (e.g., ELISA. as well as enzyme-based histochemical 
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assays), fluorescent, radioactive, and luminescent systems. In one embodiment, the present 

invention contemplates the E. coli P-galactosidase gene (available from Pharmacia Biotech. 
Pistacataway. NJ). green fluorescent protein (GFP) (commercially available from Clontech. 
Palo Alto. CA). the human placental alkaline phosphatase gene, the chloramphenicol 
5 acetyltransferase (CAT) gene: other reporter genes are known to the art and may be 
employed. 

As used herein, the terms "nucleic acid molecule encoding. " M DNA sequence 
encoding." and "DNA encoding" refer to the order or sequence of deoxyribonucleotides along 
a strand of deoxyribonucleic acid. The order of these deoxyribonucleotides determines the 
10 order of amino acids along the polypeptide (protein) chain. The DNA sequence thus codes 
for the amino acid sequence. 

DESCRIPTION OF THE INVENTION 

The present invention provides improved adenovirus vectors for the delivery of 

15 recombinant genes to cells in vitro and in vivo. As noted above, the present invention 

contemplates two approaches to improving adenovirus vectors. The first approach generallv 
contemplates a recombinant plasmid containing the minimal region of the Ad genome 
required for replication and packaging (i.e.. the left and right 1TR and the packaging or ¥ 
sequence) along with one or more genes of interest; this recombinant plasmid is packaged into 

20 an encapsidated adenovirus minichromosome (EAM) when grown in parallel with an El- 
deleted helper virus in a cell line expressing the El proteins (e.g.. 293 cells). The 
recombinant adenoviral minichromosome is preferentially packaged. To prevent the 
packaging of the helper virus, a helper virus containing loxP sequences flanking the ¥ 
sequence is employed in conjunction with 293 cells expressing Cre recombinase: Cre-IoxP 

25 mediated recombination removes the packaging sequence form the helper genome thereby 

preventing packaging of the helper during the production of EAMs. In the second approach, 
"damaged" or "deleted" adenoviruses containing deletions within the E2b region are 
employed. While the "damaged" adenovirus is capable of self-propagation in a packaging cell 
line expressing the appropriate E2b protein(s), the E2b-deleted recombinant adenovirus are 

30 incapable of replicating and expressing late viral gene products outside of the packaging cell 
line. 

In one embodiment, the self-propagating recombinant adenoviruses containing 
deletions in the E2b region of the adenovirus genome. In another embodiment, "gutted" 
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v.ruses are contemplated: these viruses lack all viral coding region, In addition, packaging 

cell h„es co-expressing El and E2b gene products are provided which allow the product.on of 

rn.ect.ous recombinant virus containing deletions in the El and E2b regions without the use 

of helper virus. 

The Description of the Invention is divided into the following sections: I Self- 
Propagating Adenovirus Vectors: II. Packaging Cell Lines: and III. Encapsidated Adenoviral 

Minichromosomes. 



I. Self-Propagating Adenovirus .Vectors 

Self-propagating adenovirus (Ad) vectors have been extensively utilized to deliver 
foreign genes .o a great variety of cell types in vitro and in vivo. "Self-propagatina viruses- 
are those which can be produced by transfection of a single piece of DNA (the recombinant 
viral genome) into a single packaging cell line to produce infectious virus: self-propagating 
viruses do not require the use of helper virus for propagation. 

Existing Ad vectors have been shown ,0 be problematic /„ vivo. This is due in pan 
because current or first generation Ad vectors are deleted for only the early recion I (El) 
genes. These vectors are crippled in their ability to replicate normally without ,he trans- 
complementation of El functions provided by human 293 cells, a packaging cell line [ATCC 
CRL 1573: Graham « al. (1977) J. Gen. Virol. 36:59]. Unfortunately, with the use of high 
litres of El deleted vectors, and the fact that there are El-like factors present in many cell 
types. El deleted vectors can overcome the block to replication and express other viral gene 
products flmperiale et al. (1984) Mol. Cell Biol. 4:867: Nevins (1981) Cell 26:213: and 
Gaynor and Berk (.983) Cell 33:683], The expression of viral proteins in the infected target 
cells elicits a swift host immune response, that is largely T-cell mediated [Yang and Wilson 
(1995) J. Immunol. 155:2564 and Yang e, al. (1994) Proc. Natl. Acad. Sci. USA 91:4407]. 
The transduced cells are subsequently eliminated, along with the transferred foreign gene. In 
immuno-incompetent animals. Ad delivered genes can be expressed for periods of up" to one 
year [Yang « al. (1994). supra: Vincent et al. (1993) Nature Genetics 5:130: and Yang et al. 
(1995) Proc. Natl. Acad. Sci. USA 92:7257]. 

Another shortcoming of first generation Ad vectors is that a single recombination 
event between the genome of an Ad vector and the integrated El sequences present in 293 
cells can generate replication competent Ad (RCA), which can readily contaminate viral 



stocks. 
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In order to further cripple viral protein expression, and also to decrease the frequency 

of generating RCA. the present invention provides Ad vectors containing deletions in the E2b 
region. Propagation of these E2b-deleted Ad vectors requires cell lines which express the 
deleted E2b gene products. The present invention provides such packaging cell lines and for 

5 the first time demonstrates that the E2b gene products. DNA polymerase and preterminal 

protein, can be constitutively expressed in 293 cells along with the El gene products. With 
every gene that can be constitutively expressed in 293 cells comes the opportunity to generate 
new versions of Ad vectors deleted for the respective genes. This has immediate benefits; 
increased carrying capacity, since the combined coding sequences of the polymerase and 

10 preterminal proteins that can be theoretically deleted approaches 4.6 kb and a decreased 

incidence of RCA generation, since two or more independent recombination events would be 
required to generate RCA. Therefore, the novel El. Ad polymerase and preterminal protein 
expressing cell lines of the present invention enable the propagation of Ad vectors with a 
carrying capacity approaching 13 kb. without the need for a contaminating helper virus 

15 [Mitani et til. (1995) Proc. Natl. Acad. Sci. USA 92:3854]. In addition, when genes critical 
to the viral life cycle are deleted (e.g., the E2b genes), a further crippling of Ad to replicate 
and express other viral gene proteins occurs. This decreases immune recognition of virally 
infected cells, and allows for extended durations of foreign gene expression. The most 
important attribute of El, polymerase, and preterminal protein deleted vectors, however, is 

20 their inability to express the respective proteins, as well as a predicted lack of expression of 
most of the viral structural proteins. For example, the major late promoter (MLP) of Ad is 
responsible for transcription of the late structural proteins LI through L5 [Doerfler. In 
Adenovirus DNA, The Viral Genome and lis Expression (Martinus Nijhoff Publishing Boston. 
1986)]. Though the MLP is minimally active prior to Ad genome replication, the rest of the 

25 late genes get transcribed and translated from the MLP only after viral genome replication has 
occurred [Thomas and Mathews (1980) Cell 22:523]. This cis-dependent activation of late 
yene transcription is a feature of DNA viruses in general, such as in the growth of polyoma 
and SV-40. The polymerase and preterminal proteins are absolutely required for Ad 
replication (unlike the E4 or protein IX proteins) and thus their deletion is extremely 

30 detrimental to Ad vector late gene expression. 
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11. Packaging Cell Lines Constitiitivcly Expressing E2b Gene 
Products 

The present invention addresses the limitations of current or first generation Ad 
vectors by isolating novel 293 cell lines coexpressing critical viral gene functions. The 
present invention describes the isolation and characterization of 293 cell lines capable of 
const.tutively expressing the Ad polymerase protein. In addition, the present invention 
describes the isolation of 293 cells which not only express the El and polymerase proteins, 
but also the Ad-preterminal protein. The isolation of cell lines coexpressing the El. Ad 
polymerase and preterminal proteins demonstrates that three genes critical to the life cycle of 
Ad can be constitutively coexpressed. without toxicity. 

In order to delete critical genes from self-propagating Ad vectors, the proteins encoded 
by the targeted genes have to first be coexpressed in 293 cells along with the El proteins. 
Therefore, only those proteins which are non-toxic when coexpressed constitutively (or toxic 
proteins inducibly-expressed) can be utilized. Coexpression in 293 cells of the El and E4 
genes has been demonstrated (utilizing inducible, not constitutive, promoters) [Yeh et al. 
(1996) J. Virol. 70:559: Wang e, al. (1995) Gene Therapy 2:775: and Gorziglia et al. (1996) 
J. Virol. 70:4173]. The El and protein IX genes (a virion structural protein) have been 
coexpressed fCaravokyri and Leppard (1995) J. Virol. 69:6627]. and coexpression of the El.. 
E4. and protein IX genes has also been described [Krougliak and Graham (1995) Hum. Gene 
Ther. 6:1575]. 

The present invention provides for the first time, cell lines coexpressing El and E2b 
gene products. The E2b region encodes the viral replication proteins which are absolutely 
required for Ad genome replication fDoerfler. supra and Pronk el al. (1992) Chromosoma 
102:S39-S45]. The present invention provides 293 cells which constitutively express the 140 
kD Ad-poiymerase. While other researchers have reported the isolation of 293 cells which 
express the Ad-preterminal protein utilizing an inducible promoter [Schaack et al. (1995) J. 
Virol. 69:4079], the present invention is the first to demonstrate the high-level, constitutive 
coexpression of the El. polymerase, and preterminal proteins in 293 cells, without toxicity. 
These novel cell lines permit the propagation of novel Ad vectors deleted for the El, 
polymerase, and preterminal proteins. 
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III. Encapsidnted Adenoviral Minichromosomes 

The present invention also provides encapsidated adenovirus minichromosome (EAM) 
consisting of an infectious encapsidated linear genome containing Ad origins of replication, 
packaging signal elements, a reporter gene (e.g.. a P-galactosidase reporter gene cassette) and 

5 a gene of interest (e.g.. a full length (14 kb) dystrophin cDNA regulated by a muscle specific 
enhancer/promoter). EAMs are generated by cotransfecting 293 cells with supercoiled 
plasmid DNA (e.g.. pAd5(Jdys) containing an embedded inverted origin of replication (and the 
remaining above elements) together with linear DNA from El -deleted virions expressing 
human placental alkaline phosphatase (hpAP) (a helper virus). All proteins necessary for the 

10 generation of EAMs are provided in trans from the hpAP virions and the two can be 

separated from each other on equilibrium CsCl "gradients. These EAMs are useful for gene 
transfer to a variety of cell types both in vitro and in vivo. 

Adenovirus-mediated gene transfer to muscle is a promising technology for gene 
therapy of Duchenne muscular dystrophy (DMD). However, currently available recombinant 

15 adenovirus vectors have several limitations, including a limited cloning capacity of -8.5 kb. 
and the induction of a host immune response that leads to transient gene expression of 3 to 4 
weeks in immunocompetent animals. Gene therapy for DMD could benefit from the 
development of adenoviral vectors with an increased cloning capacity to accommodate a full 
length (-14 kb) dystrophin cDNA. This increased capacity should also accommodate gene 

20 regulatory elements to achieve expression of transduced genes in a tissue-specific manner. 
Additional vector modifications that eliminate adenoviral genes, expression of which is 
associated with development of a host immune response, might greatly increase long term 
expression of virally delivered genes in vivo. The constructed encapsidated adenovirus 
minichromosomes of the present invention are capable of delivering up to 35 kb of non-viral 

25 exogenous DNA. These minichromosomes are derived from bacterial plasmids containing 
two fused inverted adenovirus origins of replication embedded in a circular genome, the 
adenovirus packaging signals, a Pgalactosidase reporter gene and a full length dystrophin 
cDNA regulated by a muscle specific enhancer/promoter. The encapsidated minichromosomes 
are propagated in viiro by /ra/7.v-complementation with a replication defective (E1+E3 deleted) 

30 helper virus. These minichromosomes can be propagated to high titer (>10 8 /ml) and purified 
on CsCl gradients due to their buoyancy difference relative to helper virus. These vectors are 
able to transduce myogenic cell cultures and express dystrophin in myotubes. These results 
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demonstrate that encapsidated adenovirus minichromosomes are useful for gene transfer to 

muscle and other tissues. 

The present invention further provides methods for modifying the above-described 
EAM S v,, em l0 enable the generation of high tner stocks of EAMs with mmimal helper virus 
comammaiion. Preferably the EAM stocks contain helper virus representing less than |%. 
preferably less than 0.1% and most preferably less than 0.01% (including 0.0%) of the final 

viral isolate. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The firs, is by selectively controlling the relative packaging efficiency of the helper virus 
versus the EAM virus. The Cre-loxP excision method is employed to remove the packaging 
signals trom the helper virus thereby preventing the packaging of the helper virus used to 
provide in trans viral proteins for the encapsidation of the recombinant adenovirus 
minichromosomes. The second approach to reducing or eliminating helper virus in EAM 
stocks is the use of improve physical methods for separating EAM from helper virus. 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. 

In the experimental disclosure which follows, the following abbreviations apply: M 
(molar): mM (millimolar): uM (micromolar): mol (moles); mmol (millimoles); umol 
(micromoles): nmol (nanomoles): mu or m.u. (map unit); g (gravity); gm (grams): mg 
(milligrams): ng (micrograms): pg (picograms); L (liters): ml (milliliters); ul (microliters): cm 
(centimeters): mm (millimeters); urn (micrometers): nm (nanometers): hr (hour): min 
(minute,: msec (millisecond): °C (degrees Centigrade); AMP (adenosine 5 -monophosphate): 
cDNA (copy or complimentary DNA): DTT (dithiotheritol); ddH : 0 (double distilled water); 
dNTP (deoxyribonucleotide triphosphate); rNTP (ribonucleotide triphosphate); ddNTP 
(dideoxyribonucleo.ide triphosphate): bp (base pair): kb (kilo base pair): TLC (thin layer 
chromatography): tRNA (transfer RNA); nt (nucleotide): VRC (vanadyl ribonucleoside 
complex): RNase (ribonuclease); DNase (deoxyribonuclease); poly A (polyriboadenylic acid); 
PBS (phosphate buffered saline); OD (optical density): HEPES (N-[2- 
Hydroxyethyl]p iper azine-N-[2-ethanesulfonic acid]): HBS (HEPES buffered saline); SDS 
(sodium dodecyl sulfate); Tris-HCl (tri S [Hydroxymethyl]aminomethane-hydrochlorid e ); rpm 
(revolutions per minute): ligation buffer (50 mM Tris-HCl. 10 mM MgCl,. 10 mM 
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diihiothreitol. 25 j.ig/ml bovine serum albumin, and 26 uM NAD-*-, and pH 7.8): EGTA 
(ethylene gIycoI-bis(p-aminoethyl ether) N. N. N\ N'-tetraacetic acid): EDTA 
(ethylenediaminetetracetic acid): ELISA (enzyme linked immunosorbam assay): LB (Luria- 
Bertani broth: 10 g tryptone. 5 g yeast extract, and 10 g NaCI per liter. pH adjusted to 7.5 
5 with IN NaOH); superbroth (12 g tryptone. 24 g yeast extract. 5 g glycerol. 3.8 g KH : PO, 

and 12.5 g. K : HP0 4 per liter); DMEM (Dulbecco's modified Eagle's medium): ABI (Applied 
Biosystems Inc.. Foster City. CA): Amersham (Amersham Corporation. Arlington Heights. 
IL): ATCC (American Type Culture Collection. Rockville, MY): Beckman (Beckman 
Instruments Inc.. Fullerton CA): BM (Boehringer Mannheim Biochemicals. Indianapolis. IN); 

10 Bio- 101 (Bio- 101. Vista. CA): BioRad (BioRad. Richmond. CA); Brinkmann (Brinkmann 

Instruments Inc. Wesbury. NY); BRL. Gibco BRL and Life Technologies (Bethesda Research 
Laboratories. Life Technologies Inc.. Gaithersburg, MD); CRI (Collaborative Research Inc. 
Bedford. MA): Eastman Kodak (Eastman Kodak Co.. Rochester. NY): Eppendorf (Eppendorf. 
Eppendorf North America. Inc.. Madison. WI); Falcon (Becton Dickenson Labware. Lincoln 

15 Park. NJ): IBI (International Biotechnologies. Inc., New Haven. CT): ICN (1CN Biomedicals. 
Inc.. Costa Mesa. CA); Invitrogen (Invitrogen. San Diego. CA); New Brunswick (New 
Brunswick Scientific Co. Inc. Edison. NJ): NEB (New England BioLabs Inc.. Beverly. MA): 
NEN (Du Pont NEN Products. Boston. MA); Pharmacia (Pharmacia LKB Gaithersburg, MD): 
Promega (Promega Corporation. Madison. Wl): Stratagene (Siratagene Cloning Systems. La 

20 Jolla. CA): UVP (UVP. Inc.. San Gabreil. CA); USB (United States Biochemical Corp.. 
Cleveland. OH); and Whatman (Whatman Lab. Products Inc. Clifton. NJ). 

Unless otherwise indicated, all restriction enzymes and DNA modifying enzymes were 
obtained from New England Biolabs (NEB) and used according to the manufacturers 
directions. 

25 

EXAMPLE 1 

Generation Of Packaging Cell Lines That Coexpress 
The Adenovirus El And DNA Polymerase Proteins 

30 In this example, packaging cell lines coexpressing Ad El and polymerase proteins 

were described. These cell lines were shown to support the replication and growth of H5ts36, 
an Ad with a temperature-sensitive mutation of the Ad polymerase protein. These 
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polymerase-expressing packaging cell lines can be used to prepare Ad vectors deleted for the 
El and polymerase functions. 



10 



a) Tissue Culture And Virus Growth 

LP-293 cells (Microbix Biosystems. Toronto) were grown and serially passaged as 
suggested by the supplier. 

Plaque assays were performed in 60 mm dishes containing cell monolayers at ~ 90% 
coniluency. The appropriate virus dilution in a 2% DM EM solution was dripped onto the 
cells, and the plates incubated at the appropriate temperature for one hour. The virus 
containing media was aspirated, the monolayer was overlaid with 10 mis of a pre-warmed 
EMEM agar overlay solution (0.8% Noble agar. 4% fetal calf serum, and antibiotics) and 
allowed to solidify. After the appropriate incubation time (usually 7 days for incubations at 
38.5°C and 10-12 days for incubations at 32 D C). five mis of the agar-containing solution 
containing 1.3% neutral red was overlaid onto the infected dishes and plaques were counted 
15 the next day. An aliquot of the virus H5ts36 [Freimuth and Ginsberg (1986) Proc. Natl. 

Acad. Sci. USA 83:7816] was utilized to produce high litre stocks after infection of 293 cells 
at 32°C. H5ts36 is an Ad5-derived virus defective for viral replication at the nonpermissive 
temperature [Miller and Williams (1987) J. Virol. 61:3630]. 

The infected cells were harvested after the onset of extensive cytopatliic effect, 
20 pelleted by centrifugation and resuspended in 10 mM Tris-Ci. pH 8.0. The lysate was freeze- 
thawed three times and centrifuged to remove the cell debris. The cleared lysate was applied 
to CsCI : step gradients (heavy CsCl at density of 1.45 g/ml. the light CsCl at density of 1.20 
g/ml). uhracentrifuged. and purified using standard techniques [Graham and Prevec (1991) In 
Methods in Molecular Biology, Vol 7: Gene Transfer and Expression Protocols. Murray 
(ed.). Humana Press. Clifton. NJ, pp. 109-128]. The concentration of plaque forming units 
(pfu) of this stock was determined at 32°C as described above. Virion DNA was extracted 
from the high litre stock by pronase digestion, phenol-chloroform extraction, and ethanol 
precipitation. The leakiness of this stock was found to be <1 in 2000 pfu at the non- 
permissive temperature, consistent with previous reports [Miller and Williams (1987). supra). 



30 



b) Plasmids 

The expression plasmid pRSV-pol [Zhao and Padmanabhan (1988) Cell 55:1005] 
contains sequences encoding the Ad2 polymerase mRNA (including the start codon from the 
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exon at map unit 39) under the transcriptional control of the Rous Sarcoma Virus 
LTR/'promoter element: the Ad2 DNA polymerase sequences are Hanked on the 3* end by the 
SV-40 small t intron and SV-40 polyadenylation addition site (see Fig. 1A). Figure 1A 
provides a schematic representation of the Ad polymerase expression plasmid pRSV-poI. 
pRSV-pol includes the initiator methionine and amino-termina! peptides encoded bv the exon 
at m.u. 39 of the Ad genome. The location of the PCR primers p602a and p2158c. the two 
Seal 1 kb probes utilized for Northern analyses, the polymerase terminator codon. and the 
polyadenylation site of the IVa2 gene (at 11.2 m.u. of the Ad genome) are indicated. 

The expression plasmid pRSV-pTP [Zhao and Padmanabhan (1988). supra] contains 
sequences encoding the Ad2 preterminal protein (including the amino terminal peptides 
encoded by the exon at map unit 39 of the Ad genome) under the transcriptional control of 
the Rous Sarcoma Virus LTR/promoter element; the pTP sequences are flanked on the 3' end 
by the SV-40 small-t intron and SV-40 polyadenylation signals (see Figure IB for a schematic 
of pRSV-PTP). Figure IB also shows the location of the £cv;RV subfragment utilized as a 
probe in the genomic DNA and cellular RNA evaluations, as well as the initiator methionine 
codon present from map unit 39 in the Ad5 genome. The locations of the H5/nl90 and 
HS.vwAlOO insertions are shown relative to the preterminal protein open-reading frame. The 
following abbreviations are used in Figure 1: ORF. open reading frame: small-t. small tumor 
antigen, and m.u.. map unit. 

pCEP4 is a plasmid containing a hygromycin expression cassette (Invitrogen). 

pFGUO (Microbix Biosystems Inc. Toronto. Ontario) is a plasmid containing 
sequences derived from Ad5tf/309 which contain a deletion/substitution in the E3 region 
[Jones and Shek (1979) Cell 17:683]. pFG140 is infectious in single transfections of 293 
cells and is used as a control for transfection efficiency. 

c) Transfection Of 293 Cells 

LP-293 cells were cotransfected with BamHl linearized pRSV-pol and BamH] 
linearized pCEP4 at a molar ratio of 10:1 using a standard CaP0 4 precipitation method 
[Sambrook et ai (1989) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor 
Press. Plainview. NY. pp. 16.33- 16.36]. In addition. 293 cells were cotransfected with or with 
BamHl linearized pRSV-pol. BamHl linearized pRSV-pTP and pCEP using a molar ratio of 
10:1 (non-selectable:selectable plasmids). 
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Forty-eight hours after transaction, the cells were passaged into media containing 
hygromycin at 100 ug/mL. Individual hygromycin resistant colon.es were isolated and ' 
expanded. 



d) Isolation Of Ad Polymerase Expressing 293 Cell Lines 

Twenty hygromycin resistant cell lines were expanded and screened for the ability to 
express the Ad polymerase protein. Initially, the individual cell lines were assayed for the 
ability 10 support growth of the viral polymerase mutant H5ts36 using the plaque assay 
described in section a) above. It was speculated that constitutive expression of the wild type 
Ad polymerase protein in a clonal population of 293 cells should allow the growth of H5ts36 
at 38.5-C. However, it was unclear if constitutive e.xpress.on of the Ad polymerase would be 
toxic when coexpressed with the El proteins present in 293 cells. Similar toxicity problems 
have been observed with the Ad ssDBP. and the pTP [Klessig a, al. (1984) Mol. Cell. Biol. 
4:1354 and Schaack ei al. (1995) J. Virol. 69:4079]. 

Of the twenty hygromycin resistant cell lines isolated, seven were able to support 
plaque formation with H5ts36 at the non-permissive temperature, unlike the parental LP-293 
cells: these cell lines were named B-6. B-9. C-l. C-4. C-7. C-13 and C-I4 (see Table 1) (the 
B-6 and B-9 cell lines received only the pRSV-pol and CEP4 plasmids: C-l. C-4. C-7. C-13 
and C-14 cell lines received the pRSV-pol. pRSV-pTP and CEP4 plasmids). For the results 
shown in Table 1. dishes (60 mm) of near confluent cells of each cell line were infected with 
the same dilution of H5ts36 at the temperature indicated, overlaid with agar media, and 
stained for plaques as outlined in section a. Passage number refers to the number of serial 
passages after initial transfection with the plasmid pRSV-pol. 

The cell line B-6 produced plaques one day earlier than the other Ad polymerase- 
expressing cell lines, which may reflect increased polymerase expression (see below). Cell 
line B-9 demonstrated an increased doubling time whereas each of the other cell lines 
displayed no growth disadvantages relative to the parental LP-293 cells. As shown in Table 
1- even after multiple passages (in some instances up to four months of serial passaging) the 
ceils were still capable of H5ts36 plaque formation at the non-permissive temperature, 
indicating that the RSV-LTR/promoter remained active for extended periods of time. 
However, the cell lines B-9 and C-13 displayed a decreased ability to plaque the virus at 32°C 
as well as at 38.5°C. suggesting that a global viral complementation defect had occurred in 
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these cell lines after extended passaging. The remaining cell lines screened at later passaces 
demonstrated no such defect, even after 20 passages (e.g.. cell line B-6. Table 1). 

TABLE 1 



Plaquins Ability Of H5ts36 Ai The Non-Permissive Temperature Utilizing 293-Ad Polymerase Expressii 

Cell Lines 



10 



Cell Line 


Passage Number 


Number Of Plaques At: 


32.0°C 


38.5°C 


293 


■ 


>500 


0 


B-6 


9 


>500 


>500 




20 


>500 


>500 


B-9 


5 


>500 


>500 




14 


90 


18 




5 


>500 


>500 


C-l 


13 


>500 


>500 


C-4 


5 


>500 


>500 


14 


>500 


>500 




5 


>500 


>500 


C-7 


14 


>500 


>500 




5 


>500 


>500 


C-13 


27 


120 


4 




5 


>500 


>500 


C-M 


27 


>500 


370 



e) Genomic Analysis Of Ad Polymerase-Expressing Cell Lines 

Genomic DNA from LP-293 cells and each of the seven cell lines able to complement 

H5ts36 at 38.5°C were analyzed by PCR for the presence of pRSV-pol derived sequences. 

Genomic DNA from LP-293 cells and the hygromycin resistant cell lines were harvested 
20 using standard protocols (Sambrook et al., supra) and 200 ng of DNA from each cell line was 

analyzed by PCR in a solution containing 2 ng/mL of primers p602a and p2158c. 10 mM 

TrisHCl. pH 8.3. 50 mM KC1. 1.5 mM MgCL and 0.001% gelatin. The forward primer. 

p602a [ 5 * - TTC A TTTT ATG TTTC A G GTTC AGGG-3* (SEQ ID NO:2)] is located in the SV- 

40 poiyadenylation sequence. The reverse primer p2158c [5"- 
25 TTACCGCCACACTCGCAGGG-3 , (SEQ ID NO:3)] is Ad-sequence specific with the 5' 

nucleotide located at position 3394 of the Ad 5 genome [numbering according to Doerfler 
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(1986, Mixovirus D\ : A. The Viral Genome and l,s Expression. Nijhoff. Boston. MA pp ,. 
95]. ' W ' 

PCR was performed with a Perkin Elmer 9600 Thcrmocycler utilizma the following 
cycling parameters: initial denaturation at 94°C for 3 mm. 3 cycles of denaturauon at 94°C 
for 30 sec. annealing at 50°C for 30 sec. and extens.on at 72°C for 60 sec. followed bv 
another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
72°C for 10 minutes. PCR products were separated on a 1.0 % agarose gel and visualized 
with ethidium bromide staining (Figure 2). A I kb ladder (Gibco-BRI) was used as a size 
marker, and the plasmid pRSV-pol was used as a positive control. The ^750 bp PCR products 
are indicated by an arrow in Figure 2. 

As shown in Figure 2, all cell lines capable of H5ts36 plaque formation at 38.5°C 
contained the Ad pol DNA sequences, whereas the LP-293 cells did not vield anv 
amplification product with these primers. This result demonstrates that each of the selected 
cell lines stably co-integrated not only the hygromycin resistance plasmid pCEP4 but also 
pRSV-pol. 



0 Complementation Of The Replication Defect Of H5ts36 By 
Ad Polymerase Expressing Cell Lines 

The C to T transition at position 7623 of the H5ts36 genome alters the DNA binding 
affinity of the Ad polymerase protein, rendering it defective for viral replication at non- 
permissive temperatures [Chen et al. (1994) Virology 205:364: Miller and Williams (1987) J 
Virol. 61:3630: and Wilkie e , al. (1973) V.rology 51:499], To analyze the funct.onal activiiv 
of the Ad polymerase protein expressed by each of the packaging cell lines, a viral 
replication-complementation assay was performed. LP-293 cells or the hvgromvein resistant 
cell lines were seeded onto 60 mM dishes at densit.es of 2.5-3.0 x 10'per dish, infected with 
H5ts36 at a multiplicity of infection (MOI) of 10. and incubated for 24 hours at 38.5°C. or 48 
hours at 32°C. Total DNA was harvested from each plate, then 2 ug of each sample were 
digested with HindlU. separated on a 1.0% agarose gel. transferred to a nylon membrane, and 
hybridized with S -P.|abeled H5«s36 virion DNA. Densitometry analysis of the 8.010 bp 
////7dlll fragment in each lane was performed on a phophoroimager (Molecular Dynamics) 
utilizing a gel image processing system (IP Lab Version 1.5. Sunnyvale. CA). 
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The resulting autoradiograph is shown in Figure 3. In Figure 3. the lane marked 
"Std." (standard) contains 1 fig of ///nd 1 1 1 -d i gesied H5ts36 virion DNA. The 8.010 bp 
Hindlll fragments analyzed by densitometry are indicated by an arrow. 

As shown in Figure 3. H5ts36 had a diminished ability to replicate in LP-293 cells at 
5 the non-permissive temperature. In contrast, all seven of the previously selected cell lines 

were able to support replication of H5ts36 virion DNA at 38.5°C to levels approaching those 
occurring in LP-293 cells at 32°C 

A densitometric analysis of the amount of H5ts36 viral DNA replicated in each of the 
cell lines at permissive and nonpermissive temperatures is presented in Table 2 below. For 
10 this assay, the relative amounts of the 8.010 bp ///mil 1 1 fragment were compared. The 

relative levels of H5ts36 virion DNA replication determined by densitometric analysis of the 
8.010 bp HindU] fragment isolated from each of the cell line DNA samples. The surface area 
of the 8.010 bp fragment in 293 cells incubated at 38.5°C was designated as 1. and includes 
some replicated H5ts36 virion DNA. The numbers in each column represent the ratio 
15 between the densities of the 8.010 bp fragment isolated in the indicated cell line and the 
density of the same band present in LP-293 cells at 38.5°C. 

As shown in Table 2. the levels of replication at the permissive temperature were all 
within four-fold of each other, regardless of which cell line was analyzed, but at the non- 
permissive temperature LP-293 cells reveal the H5ts36 replication defect. The viral bands 
20 that were present in the LP-293 DNA sample at 38. C represented input virion DNA as well as 
low level replication of H5ts36 DNA. which is generated due to the leakiness of the ts 
mutation at the high MOl utilized in this experiment. The Ad polymerase-expressing cell 
lines were all found to be capable of augmenting 
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TABLE 2 

Densitometry Analysis Of H5is36 Replication 



Cell Line 



Ralios Of 8.010 bp w„dl|| Fragment Generated At: 




H,ts36 genome replication. Although one cell line (B-9) allowed H5ts36 replicat.on to levels 
only ,6 fold greater than LP-293 cell, this was the same cel. line that was observed to 
d-splav poor growth properttes. Each of the remaining Ad polymerase-ex pressing cell l.nes 
allowed substantially greater replication of H5ts36 at non-permissive temperatures, compared 
•o LP-293 cells (Table 2). An enhancement of replication up to 75 fold above that of LP-?93 
cells was observed with the cel. line C-7 at 38.5°C. A substantially more rapid onset of viral 
CVtopathic effect in Ad polymerase-expressmg cel. lines was observed at either temperature 
These estimates of H5ts36 repHcation-complementatton are conservative, since thev have not 
been adjusted for the low level replication of H5ts36 at 38.5°C [Miller and Willi™ (,987) 
supra]. The leakiness of the H5ts36 mutation could potentially be overcome with the use of a 
virus deleted for the Ad polymerase gene. 



g) RNA Analysis Of Ad Polymerase-Exprcssing Cell Lines 

Total RNA was extracted from each of the cell lines using the RNAzol method 
(Teltest. Inc.. Friendswood. TX 77546: Chomczynski and Sacchi (1987) Anal. Biochem 
162:1,6]. Fifteen micrograms of RNA from each cell line was electrophoresed on a 0 S% 
agarose-iormaldehyde gel. and transferred to a Nytran membrane (Schleicher & Schuel.) bv 
blotung. The filter was UV crosslinked. and analyzed by probing with the two labeled 1 
kb SuA subiragments of Ad which span positions 6095-8105 of the Ad5 genome ,see Figure 
1A). These two Seal subfragments of the Ad genome are complimentary to the 5' end of the 
Ad polymerase mRNA. The resulting autoradiograph is shown in Figure 4. ,„ Fi,ure 4 the 
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location of ihe smaller and larger species of Ad polymerase mRNA are indicated relative to 
the 28S and 18S ribosomal RNAs. The aberrant transcript expressed by the B-9 cell line is 
indicated by an arrow. 

The results shown in Figure 4 revealed that the RNA derived from cell lines B-6 and 
5 C-7 contained two species of RNA. estimated to be -4800 and -7000 nt in length, while LP-293 
derived RNA had no detectable hybridization signal. The presence of two polymerase RNA 
species suggests that the polyadenylation signal of the Ad IVa2 gene (present in the Ad-pol 
construct, see Fig. 1 A) is being utilized by the cell RNA processing machinery, in addition to 
the SV-40 polyadenylation signal. Similar analysis of RNA derived from the cell lines C-l. 

10 C-4. C-l 3. and C-l 4 also detected the same two transcripts as those detected in the RNA of 
cell lines B-6 and C-7. but at decreased levels, suggesting that even low levels of Ad 
polymerase mRNA expression can allow for the efficient replication of polymerase mutants 
such as H5ts36. The cell line B-6 expressed high levels of polymerase transcript and can 
plaque H5ts36 one day earlier than the other cell lines at 38.5°C. suggesting a causal 

15 relationship. It is interesting to note that the two polymerase transcripts are also detected in 
RNA isolated from cell line B-9, but substantial amounts of a larger RNA transcript (size > 
10 kb) is also present (see Figure 4). The high level production of the aberrant message may 
be related to the increased doubling time previously noted in this cell line. 

20 h) Transferability Of Ad Polymerase-Expressing Cell Lines 

The ability of Ad polymerase-expressing 293 cell lines to support production of 
H5ts36 virions after iransfection with H5ts36 genome DNA was examined as follows. 293 
ceils as well as hygromycin resistant cell lines were grown to near confluency on 60 mm 
dishes and transfected with either 3 jig of purified H5ts36 virion DNA. or with 3.5 ng of the 
25 plasmid pFG140 (Microbix Biosystems), using the caiionic lipid Lipofectamine (Gibco-BRL). 
Cells that received the H5ts36 virion DNA were incubated at 32°C for 14 days, or 38.5°C for 
10 days. The pFG140 transfected cells were incubated at 37,5°C for 10 days. All plates 
were then stained with the neutral red agar overlay and plaques were counted the next day. 
The results are shown in Table 3. 
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TABLE 3 

Transfection Efficiency Of Ad pol-Exprcssing Cell Lii 




n.d. - not determined 



The results shown in Table 3 demonstrated that transfection of H5ts36 DNA at the 
non-permissive temperature allows for ample plaque production in all of the Ad polvmerase- 
expressmg cell lines tested, unlike the parental LP-293 cells. Cell line C-13 was at passage 
number 29. and demonstrated a somewhat decreased ability to generate plaques at this 
extended passage number. These same cell lines are a.so capab.e of producing plaques when 
.ra.sfec.ed with the plasmid pFGMO. a plasmid capable of producing infectious. El 
dependent Ad upon transfection of the parental 293 cells [Ghosh-Choudhury ( 1 986) Gene 
^0:1 61]. These observations suggest that the Ad polymerase expressing cell lines should be 
useru. lor the production of second generation Ad vectors deleted not only lor the El genes 
but also for the polymerase gene. As shown below in Examples 2 and 3, this is indeed the 
case. 



EXAMPLE 2 

Isolation And Characterization Of Packaging Cell Lines That 
Coexpress The Adenovirus El. DNA Polymerase And Preterminal Proteins 

In Example 1. packaging cell lines coexpressing Ad EI and polymerase proteins were 
descnbed. These cell lines were shown to support the replication and growth of H5ts36. an 
Ad w,th a temperature-sensitive mutation of the Ad polymerase protein. These polymerase- 
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expressing packaging cell lines can be used to prepare Ad vectors deleted for the El and 
polymerase functions. In this example. 293 ceils cotransfected with both Ad polymerase and 
preterminal protein expression plasmids are characterized. Cell lines co-expressing the Ad El. 
polymerase and preterminal proteins can be used to prepare Ad vectors deleted for the El. 
polymerase and preterminal protein (pTP) functions. 

a) Tissue Culture And Virus Propagation 

The use of LP-293 cells (Microbix Biosystems inc.. Toronto). Ad-polymerase 
expressing cell lines, and plaquing efficiency assays of Ad viruses was conducted as described 
in Example 1. All cells were maintained in 10% fetal bovine serum supplemented DMEM 
media fGIBCO) in the presence of antibiotics. The virus \i5sub\00 [Freimuth and Ginsberg 
(1986) Proc. Natl. Acad. Sci. USA 83:7816] has a temperature sensitive (ts) mutation caused 
by a three base pair insertion within the amino terminus of the preterminal protein, in addition 
to a deletion of the El sequences (see Figure IB). U5sub\00 was propagated and titred at 
32.0°C in LP-293 ceils: the leakiness of this stock was less than 1 per 1000 plaque-forming 
units (pfu) at the nonpermissive temperature of 38.5°C. A lower titer cell lysate contaning 
the virus H5//7190 (which contains a 12 base-pair insertion within the carboxy-terminus of the 
preterminal protein as well as a deletion of the El region, see Figure IB) was provided by Dr. 
P. Freimuth [Freimuth and Ginsberg (1986), supra]. The polymerase and preterminal protein 
expressing cell lines were always maintained in media supplemented with hygromycin 
(Sigma) at 100 ng/mL. 

b) Isolation Of Ad Polymerase And Preterminal Protein 
Expressing 293 Cells 

The C-l. C-4, C-7. C-13. and C-14 cell lines (Ex. 1). which had been cotransfected 
with pRSV-pol. pRSV-pTP and CEP4. were screened for presence of pTP sequences and for 
the ability to support the growth of H5ts36 (ts for the Ad-polymerase), H5/wl90. and 
H5sitb\00 using plaque assays as described in Example 1. 
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i) Analysis Of Genomic DNA And Cellular RNA 

Cell lines that had received the preterminal protein expression plasmid were screen for 
the presence of pRSV-pTP sequences and El sequences. Total DNA was isolated from the 
LP-293. B-6 (transfected with pRSV-pol only) or C-7 (cotransfected with pRSV-pol and 
pRSV-pTP) cell lines, two micrograms lug) of each DNA was codigested with the restriction 
enzymes Xbal and Bo/wHI. electrophoretically separated in a 0.6% agarose gel. and 
transferred onto a nylon membrane. The membrane was UV crosslinked. probed with both a 
1.8 kb BM-Xbal fragment (spans the El coding region) isolated from the plasmid pFG140. 
and a 1.8kb EcoRV ^fragment of Ad serotype 5 (spans the preterminal protein coding 
sequences: see Fig. IB), both of which were random-primer radiolabeled with K P to a specific 
activity greater than 3.0 x 10 8 cpm/ug. The membrane was subsequently exposed to X-ray 
film with enhancement by a fluorescent screen. The resulting autoradiograph is shown in 
Figure 5. 

In Figure 5. the preterminal specific sequences migrated as an ~1 1.0 kb DNA fragment 
while the El containing band migrated as a 2.3 kb DNA fragment. No hybridization of either 
probe to DNA isolated from either the LP-293 or B-6 cell lines was observed As shown in 
Figure 5. only the C-7 cell line genomic DNA had preterminal coding sequences, unlike the 
parental LP-293 cells, or the Ad-polymerase expressing B-6 cells. In addition, all cell lines 
had El specific sequences present at nearly equivalent amounts, demonstrating that the 
selection design has not caused the loss of the El sequences originally present in the LP-293 
cells. The results presented in Example 1 demonstrated that both the B-6 and C-7 cell lines 
contain polymerase specific sequences within their genomes, unlike the parental LP-293 cells. 

To confirm that transcription of preterminal protein was occurring, total RNA was 
isolated from each of the cell lines, transferred to nylon membranes, and probed to detect 
preterminal protein-specific mRNA transcripts as follows. Total cellular RNA was isolated 
from the respective cell lines and 15 ug of total RNA from each cell line was transferred to 
nylon membranes. The membranes were probed with the 1 .8 kb £coRV radiolabeled 
subfragment of Ad5 (see Fig. IB) complementary to the preterminal protein coding region. 
The resulting autoradiograph is shown in Figure 6. 

As shown in Figure 6. a single mRNA of the expected size (~3 kb in length) is detected 
only in RNA derived from the C-7 cell line. No hybridization was detected in lanes 
containing RNA derived from the LP-293 or B-6 cell lines. In Example 1. it was 
demonstrated that the C-7 cell line also expresses high levels of the Ad polymerase mRNA. 
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Thus, ihe C-7 cell line consiitutively expresses both the Ad polymerase and preterminal 
protein mRNAs along with E 1 transcripts. 



10 



15 



20 



ii) Plaquing Efficiency Of pTP Mutants On pTP- 
Exprcssing Cell Lines 

The C-7 cell line was screened for the ability to transcomplement the growth of 
preterminal mutant viruses. The virus H5/>?190 (contains a 12 base pair insertion located 
within the carboxy-terminus of the preterminal protein) has been shown to have a severe 
growth and replication defect, producing less than 10 plaque-forming units per cell [Freimuth 
and Ginsberg (1986) Proc. Natl. Acad. Sci. USA 83:7816]. The results are summarized in 
Table 4 below. For the results shown in Table 4, LP-293. B-6. or C-7 cells were seeded at a 
density of 2.0-2.5 x 10 6 cells per plate. The cells were infected with limiting dilutions of 
lysates derived from the preterminal protein-mutant viruses H5//?I90 or H5sub\00. incubated 
at 38.5°C. and plaques counted after six days. As shown in Table 4. only the C-7 cell line 
could allow efficient plaque formation of H5/nl90 at 38.5°C (the H5/wl90 lysate used to 
infect the cells was of a low titer, relative to the high titer H5.yu6100 stock), while both the B- 
6 and C-7 cell lines had nearly equivalent plaquing efficiencies when H5.vn&100 was utilized 
as the infecting virus. 

TABLE 4 



Virus 


Mutation 
Location 


Plaque Titres (pfu/ml) 


LP-293 


B-6 


C-7 


H5ml90 


carboxy-terminus 


<1 x 10 : 


<1 x 10 : 


1.4 x |(F 


H5sttb\00 


amino-terminus 


<i x icr 


9.0 x 10" 


4.5 x I0 h 



30 



As shown in Table 4. when equivalent dilutions of H5/wl90 were utilized, the plaquing 
efficiency of the C-7 cell line was at least 100- fold greater than that of the B-6 or LP-293 
cells. This result demonstrated that the C-7 cell line produces a functional preterminal 
protein, capable of trans-complementing the defect of the H5/7?190 derived preterminal 
protein. 

The cell lines were next screened for the ability to trans-complement with the 
temperature-sensitive virus. H5™6100. at nonpermissive temperatures. H5sub\0Q has a codon 
insertion mutation within ihe amino-terminus of the preterminal protein, as well as an El 
deletion. The mutation is responsible both for a temperature sensitive growth defect, as well 



- 34 - 



20 



30 



WO 98/17783 PCT/US97/19541 

as a replication defect [Freimuih and Ginsberg ( 1986). supra and Schaack ei al. (1995) j. 
Virol. 69:4079], The plaquing efficiency of the cell line C-7 was found to be at least 1000 
fold greater than that of the LP-293 cells (at non-permissive temperatures) (see Table 4). 
Interestingly, the cell line B-6 was also capable of producing large numbers of USsublOO 
derived plaques at 38.5T. even though it does not express any preterminal protein. This 
result suggested that the high level expression of the polymerase protein was allowing plaque 
formation of HSsublOO in the B-6 cell line. To examine this possibility, the nature of 
H5.vuA100 growth in the various cell lines was examined. 

c) Complementation Of The Replication And Growth Defects 
Of HSsubiOO 

The cell lines B-6 and C-7 were shown to overcome the replication defect of H5ts36 
(Ex. 1). Since the preterminal and polymerase proteins are known to physically interact with 
each other [Zhao and Padmanabhan (1988) Cell 55:1005). we investigated whether the 
expression of the Ad-polymerase could overcome the replication defect of HSsublOO at non- 
permissive temperatures using the following replication-complememation assay. 

LP-293. B-6. or C-7 cells were seeded onto 60 mM dishes at a density of 2x10* cells 
per dish and infected the next day with H5subl00 at a multiplicity of infection (MOl) of 0.25, 
and incubated at 38.5°C for 16 hours or 32.0°C for 40 hours. The cells from each infected 
plate were then harvested and total DNA extracted as described in Example 1. Four 
micrograms of each DNA sample was digested with HindlU. electrophoresed through a 0.7% 
agarose gei. transferred to a nylon membrane, and probed with :o P-labeled H5ts36 virion 
DNA. The resulting autoradiograph is shown in Figure 7. As seen in Figure 7. the 
H5^100 replication defect when grown in LP-293 cells at 38.5°C is seen; this defect is not 
present when the virus is grown at the same temperature in either B-6 or C-7 cells. 

The results depicted in Figure 7 demonstrates that both cell lines B-6 and C-7 could 
trans-complement the replication defect of H5sub\00. This result demonstrated that the 
expression of the Ad polymerase in B-6 cells was able to overcome the preterminal protein- 
mediated replication defect of H5sub\00. While not limiting the present invention to any 
particular mechanism, the ability of Ad polymerase to overcome the preterminal protein- 
mediated replication defect of HSsublQO may be due to a direct physical interaction of the 
polymerase with the amino-terminus of the H5^100-derived preterminal protein. 
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In support of this hypothesis, it has also been demonstrated that the H5m/>100 
replication defect can be overcome when LP-293 cells were infected with a 100-fold greater 
amount of H5aw6100. However, complementation of the HS^/ftlOO replication defect is not 
sufficient to overcome the growth defect of USsublOO. since temperature shift-up experiments 
have demonstrated that the H5.S7/6100 growth defect is not dependent upon viral replication 
[Schaack et al (1995). supra]. Therefore, the overexpression of the Ad-polymerase must 
have allowed a very low level but detectable production of infectious H5.y«A100 particles in 
the B-6 cell line. The reduced growth of H5jh/6100 is therefore not due to a replication 
defect, but rather some other critical activity that the preterminal protein has a role in. such as 
augmention of viral transcription by association with the nuclear matrix [Schaack and Shenk 
(1989) Curr. Top. Microbiol. Immunol. 144:185 and Hauser and Chamberlain (1996) J. Endo. 
149:373]. This was confirmed by assessing the ability of the C-7 cell line to overcome the 
growth defect of U5sub\00 utilizing one-step growth assays performed as follows. 

Each of the cell lines (LP-293. B-6 and C-7) were seeded onto 60 mm dishes at 2.0 x 
10* cells/dish. The cell lines were infected at an MOl of 4 with each of the appropriate 
viruses (\vtAd5. H5ts36, or H5jwA100), and incubated at 38.5°C for 40 hours. The total 
amount of infectious virions produced in each 60 mm dish was released from the cell lysates 
by three cycles of freeze-thawing, and the titer was then determined by limiting dilution and 
plaque assay on B-6 cells at 38.5°C. The results are summarized in Figure 8. 

The results shown in Figure 8. demonstrated that even though the B-6 cell line allowed 
normal replication and plaque formation of H5.«v6100 at 38.5°C (in fact. B-6 cells were 
utilized to determine the plaque titres depicted in Figure 8) they could not allow high level 
growth of H5a-w6100 and only produced titres of H5sub\00 equivalent to that produced by the 
LP-293 cells. The C-7 cell line produced 100 fold more virus than the LP-293 or B-6 cells, 
see Figure 8. Encouragingly, the titre of H5.vw/?100 produced by the C-7 cells approached 
titres produced by LP-293 cells infected with wild-type virus. Ad5. When the HSsublOO 
virions produced from infection of the C-7 cells were used to infect LP-293 cells at 38.5°C. 
all virus produced retained the is mutation {i.e.. at least a 1000 fold drop in pfu was detected 
when LP-293 cells were respectively infected at 38.5°C vs. 32.0°C). This finding effectively 
rules out the theoretical possibility that the H5sivZ>100 input virus genomes recombined with 
the preterminal protein sequences present in the C-7 cells. 

In addition, the C-7 cell line allowed the high level growth of H5ts36. demonstrating 
that adequate amounts of the Ad-polymerase protein were also being expressed. The C-7 cell 
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line was capable of trans-complementing the growth of both H5ts36 and H5sub\00 after 4 
months of serial passaging, demonstrating that the coexpression of the El. preterminal, and 
polymerase proteins was not toxic. 

These results demonstrate that the constitutive expression of both the polymerase and 
preterminal proteins is not detrimental to normal virus production, which might have occurred 
if one or both of the proteins had to be expressed only during a narrow time period during the 
Ad life cycle, In summary, these results demonstrated that the C-7 cell line can be used as a 
packaging cell line to allow the high level growth of El. preterminal, and polymerase deleted 
Ad vectors. 

EXAMPLE 3 

Production Of Adenovirus Vectors 
Deleted For E I And Polymerase Functions 

In order to produce an Ad vectors deleted for El and polymerase functions, a small, 
frame-shifting deletion was introduced into the Ad-pol gene contained within an El -deleted 
Ad genome. The plasmid pBHGl 1 (Microbix) was used as the source of an El -deleted Ad 
genome. pBHGll contains a deletion of Ad5 sequences from bp 188 to bp 1339 (0.5-3.7 
m.u.): this deletion removes the packaging signals as well as El sequences. pBHGll also 
contains a large deletion within the E3 region (bp 27865 to bp 30995: 77.5-86.2 m.u.). The 
nucleotide sequence of pBHGl 1 is listed in SEQ ID NO:4 [for cross-corrleation between the 
pBHGl 1 sequence and the Ad5 genome (SEQ ID NO:l), it is noted that nucleotide 8,773 in 
pBHGl 1 is equivalent to nucleotide 7.269 in Ad5]. 

pBHGl 1 was chosen to provide the Ad backbone because this plasmid contains a large 
deletion within the E3 region (77.5 to 86.2 m.u.) and therefore vectors derived from this 
plasmid permit the insertion of large pieces of foreign DNA. A large cloning capacity is 
important when the poP vectors is to be used to transfer a large gene such as the dystrophin 
gene (cDNA = 13.6 kb). However, the majority of genes are not this large and therefore 
other Ad backbones containing smaller deletions within the E3 region (e.g.. pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.: Microbix) may be employed for the 
construction of pol" vectors using the strategy outlined below. 
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a) Construction Of A Plasmid Containing A Portion Of The 
Adenovirus Genome Containing The Polymerase Gene 

A fragment of the Ad genome containing the pol gene located on pBHGl 1 was 
subcloned to create pBSA-XB. Due to the large size of the Ad genome, this intermediate 
5 plasmid was constructed to facilitate the introduction of a deletion within the pol gene the pol 
deletion. pBSA-XB was constructed as follows. The polylinker region of pBluescript 
(Slratagene) was modified to include additional restriction enzyme recognition sites (the 
sequence of the modified polylinker is provided in SEQ ID NO:5: the remainder of 
pBluescript was not altered); the resulting plasmid was termed pBSX. pBHGl 1 was digested 
10 with Xba\ and BamW\ and the 20.223 kb fragment containing the pol and pTP coding regions 
(E2b region) was inserted into pBSX digested with Xba\ and BamU\ to generate pBSA-XB. 

b) Construction Of pBHGl lApol 

A deletion was introduced into the pol coding region contained within pBSA-XB in 
15 such a manner that other key viral elements were not disturbed (e.g., the major late promoter, 
the tripartite leader sequences, the pTP gene and other leader sequences critical for normal 
virus viability). The deletion of the pol sequences was carried out as follows. pBSA-XB was 
digested with BspE] and the ends were filled in using T4 DNA polymerase. The BspEV 
digested. T4 polymerase filled DNA was then digested with BamH\ and the 8809 bp 
20 BamH\/BspE](f\\\ed) fragment was isolated as follows. The treated DNA was run on a 0.6% 
agarose gel (TAE buffer) and the 8809 bp fragment was excised from the gel and purified 
using a QIAEX Gel Extraction Kit according to the manufacturer's instructions (OIAGEN. 
Chatsworth. CA). 

A second aliquot of pBSA-XB DNA was digested with BspHl and the ends were filled 
25 in with T4 DNA polymerase. The S^pHI-digested. T4 polymerase filled DNA was then 
digested with BamH\ and the 13.679 bp ^awHI/^HUfilled) fragment was isolated as 
described above. 

The purified 8809 bp BamHl/BspEUfiUed) fragment and the purified 13.679 bp 
BamH\/B<>pH](f\\\ed) fragment were ligated to generate pApol. pApol contains a 612 bp 
30 deletion within the pol gene (bp 8772 to 9385; numbering relative to that of pBHGl 1) and 
lacks the 1 1.4 kb BamH\ fragment containing the right arm of the Ad genome found within 
* pBHGll. 
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To prov.de the nght arm of the Ad genome. pApol was digested with BamHl followed 
by treatment with calf intestinal alkaline phosphatase. pBHGl 1 was digested with BamHl and 
the 1 1.4 kb fragment was isolated and purified using a Q1AEX Gel Extraction Kit as 
described above. The purified 1 1.4 kb BamHl fragment was ligated to the 
5 toHl/phosphatased pApol to generate pBHGl lApol. Proper construction of pBHGl lApol 
was confirmed restriction digestion (HindlU). 

c) Rescue And Propagation Of Ad5Apol Virus 

The Ad genome contained within pBHGl lApol lacks the packaging signals. 
10 Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl lApol. this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 
15 pAElspl A. pAElsplB (Microbix) or pAdBgill (pAdBglll is a standard shuttle vector which 
contains 0-1 m.u. and 9-16 m.u. of the adenovirus genome). 

To rescue virus. pBHGl lApol was co-transfected with Ad5<//7001 viral DNA 
Ad5 J/7001 contains a deletion in the E3 region; the E3 deletion contained within Ad5<//7001 
is smaller than the deletion contained within the Ad genome contained within pBHGIl. It is 
not necessary that Ad5c//7001 be used to recover virus: other adenoviruses, including wild- 
type adenoviruses, may be used to rescue of virus from pBHGl lApol. 

It has been reported that the generation of recombinant Ads is more efficient if Ad 
DNA-.erminal protein complex (TPC) is employed in conjunction with a plasmid containing 
the desired deletion [Miyake ei al. (1996) Proc. Natl, Acad. Sci. USA 93:1320]. Accordingly. 
Ad5dl7001-TPC were prepared as described [Miyake ei al. (1996). supra). Briefly, purified 
Ad5<//7001 virions (purified through an isopycnic CsCl gradient centered at 1.34 g/ml) were 
lysed by the addition of an equal volume of 8 M guanidine hydrochloride. The released 
Ad5 J/7001 DNA-TPC was then purified through a buoyant density gradient of 2.8 M CsCl/4 
M guanidine hydrochloride by centrifugation for 16 hr at 55.000 rpm in a VTi65 rotor 
(Beckman). Gradient fractions containing Ad5c//7001 DNA-TPC were identified using an 
ethidium bromide spot test and then pooled, dialyzed extensively against TE buffer. BSA was 
then added to a final concentration of 0.5 mg/ml and aliquots were stored at -80°C. The 
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Ad5c//700I DNA-TPC was then digested with Seal and then gel-filtered through a Sephadex 
G-50 spin column. 

One hundred nanograms of the digested Ad5 J/7001 DNA-TPC was mixed with 5 
of pBHGl lApol and used to transfect poi-expressing 293 cells (i.e.. C-7). Approximately 10 
5 days post-transfection. plaques were picked. The recombinant viruses were plaque purified 
and propagated using standard techniques (Graham and Prevac. supra). Viral DNA was 
isolated, digested with restriction enzymes and subjected to Southern blottinn analysis to 
determine the organization of the recovered viruses. Two forms of virus containing the 612 
bp pol deletion were recovered and termed Ad5ApolAE3I and Ad5ApolAE3Il. One form of 

10 recombinant pol" virus recovered. Ad5ApolAE3I. underwent a double recombination event 
with the Ad5t//7001 sequences and contains the E3 deletion contained within Ad5t//7001 at 
the right end of the genome. The second form of recombinant pol" virus. Ad5ApolAE3II. 
retained pBHGl 1 sequences at the right end of the genome (i.e., contained the E3 deletion 
found within pBHGl 1). These results demonstrate the production of a recombinant Ad vector 

15 containing a deletion within the pol gene. 

d) Characterization Of The E1+, pol- Viruses 

To demonstrate that the deletion contained within these two poP viruses renders the 
virus incapable of producing functional polymerase. Ad5ApolAE3I was used to infect 293 

20 cells and poi-expressing 293 cells (B-6 and C-7 cell lines) and a viral replication- 
complementation assay was performed as described in Example le. Briefly. 293. B-6 and C-7 
cells were seeded onto 60 mm dishes at a density of 2 x 106 cells/dish and infected with 
H5.VZ/A100 or Ad5ApolAE31 at an MOl of 0.25. The infected cells were then incubated at 
37°C or 38.5°C for 16 hours, or at 32.0°C for 40 hours. Cells from each infected plate were 

25 then harvested and total DNA was extracted. Four micrograms of each DNA sample was 
digested with HindlU. electrophoresised through an agarose gel. transferred to a nylon 
membrane and probed with 3: P-labeled adenoviral DNA. The resulting autograph is shown in 
Figure 9. In Figure 9. each panel shows, from left to right, DNA extracted from 293. B-6 
and C-7 cells, respectively infected with either U5sub\00 or Ad5ApolAE3I (labeled 

30 AdSAPOL in Fig. 9). 

As shown in Figure 9. the recombinant pol" virus was found to be viable on pol- 
expressing 293 cells but not on 293 cells. These results demonstrates that recombinant Ad 
viruses containing the 612 bp deletion found within pApol lack the ability to express Ad 
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polymerase. These results also demonstrate that B-6 and C-7 cells efficiently complement the 
Apol found within Ad5ApolAE3I and Ad5A P olAE31I. In addinon. these results show that 
replication of the , S pTP mutant H5^100 can be complemented by high level expression of 
the Ad polymerase with or without co-expression of pTP. 

Because the expression of early genes is required for the expression of the late gene 
products, the ability of recombinant viruses contaning the Apol deletion to d.rect the 
expression of late gene products was examined. 293 and C-7 cells were infected with 
Ad5ApolAE3I and 24 hours after infection cell extracts were prepared. The cell extracts were 
senal.y diluted and examined for the expression of the fiber protein (a late gene product) bv 
.mmunoblot analysis. The immunoblot was performed as described in Example 3C with the 
exception that the primary antibody used was an anti-fiber antibody (FIBER-KNOB obtained 
from Robert Gerard. University of Texas Southwestern Medical School). The results of this 
immunoblotting analysis revealed that no fiber protein was detected from 293 cells infected 
with Ad5A P olAE3I (at any dilutuion of the cell extract). In contrast, even a 1:1000 dilution 
of cell extract prepared from C-7 cells infected with Ad5ApolAE3I produced a visible band 
on the immunoblot. Therefore, the pol deletion contained within Ad5ApolAE3I resulted in a 
greater than 1000-fold decrease in fiber production. 

The above results demonstrate that polymerase gene sequences can be deleted from the 
virus and that the resulting deleted virus will only grow on cells producing Ad-poiymen.se in 
trans. Using the pol-expressing cell lines described herein (e.g.. B-6 and C-7). large 
quantities of the pol" viruses can be prepared. A dramatic shut-down in growth and late gene 
expression is seen when cells which do not express Ad polymerase are infected with the pol" 
viruses. 



e) Generation Of El", Pol" Ad Vectors 

Ad5c//7001 used above to recuse virus containing the polymerase deletion is an El- 
containing virus. The presence of El sequences on the recombinant pol" viruses is 
undesirable when the recombinant virus is to be used to transfer genes into the tissues of 
animals: the El region encodes the transforming genes and such viruses replicate extremely 
well in vivo leading to an immune response directed against cells infected with the El- 
containing virus. 

El" viruses containing the above-described polymerase deletion are generated as 
follows. pBHGl lApol is cotransfected into pol-expressing 293 cells (e.g.. B-6 or C-7) along 
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with a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shuttle vectors include pAElsplA (Microbix). pAElsplB (Microbix) or pAdBglll. 
The gene of interest is inserted into the polylinker region of the shuttle vector and this 
piasmid is then cotransfected into B-6 or C-7 cells along with pBHGl 1 Apol to generate a 
recombinant El", pol" Ad vector containing the gene of interest. 



EXAMPLE 4 

Production Of Adenovirus Vectors 
Deleted For El And Preterminal Protein Functions 

10 

In order to produce an Ad vectors deleted for El and preterminal protein functions, a 
small deletion was introduced into the Ad preterminal protein (pTP) gene contained within an 
EI -deleted Ad genome. The piasmid pBHGl 1 f Microbix) was used as the source of an El- 
deleted Ad genome to maximize the cloning capacity of the resulting pTP~ vector. However. 
15 other Ad backbones containing smaller deletions within the E3 region {e.g.. pBHGlO which 
contains a deletion between 78.3 to 85.8 m.u.; Microbix) may be employed for the 
construction of pTP" vectors using the strategy outlined below. 

a) Construction Of pApTP 

20 A deletion was introduced into the pTP coding region contained within pBSA-XB (Ex. 

3) in such a manner that other key viral elements were not disturbed (e.g.. the tripartite leader 
sequences, the i-leader sequences, the VA-RNA I and II genes, the 55 kD gene and the pol 
gene). The deletion of the pTP sequences was carried out as follows. pBSA-XB was 
digested with Xbal and EcoRV and the 7.875 kb fragment was isolated as described (Ex. 3). 

25 Another aliquot of pBSA-XB was digested with Muni and the ends were filled in using T4 
DNA polymerase. The Mw/?I-digested, T4 polymerase filled DNA was then digested with 
Xbal and the 14.894 kb Xbal/MunK filled) fragment was isolated as described (Ex. 3). The 
7.875 kb Muni fragment and the 14.894 kb A7>al/A*uwl( filled) fragment were ligated together 
to generate pApTP. 
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b) Construction Of pBHGHApTP 

pApol contains a 429 bp deletion within the pTP gene (bp 10.705 to 1 1.134;' 
numbering relative to that of pBHGl 1) and lacks the 1 1.4 kb BamHl fragment containing the 
right arm of the Ad genome found within pBHGl 1. 

To Provide the right arm of ihe Ad genome. pApTP was digested with BamHl 
followed by treatment with shrimp alkaline phosphatase (SAP: U.S. Biochemicals. Cleveland. 
OH). pBHGl 1 was digested with BamHl and the 1 1.4 kb fragment was isolated and purified 
using a QIAEX Gel Extraction Kit as described (Ex. 3). The purified 1 1.4 kb BamHl 
fragment was ligated to the &//»HI/phosphatased pApTP to generate pBHGl lApTP. Proper 
10 construction of pBHGl 1 ApTP was confirmed restriction digestion. 

c) Rescue And Propagation Of Ad Vectors Containing The pTP 
Deletion 

The Ad genome contained within pBHGl lApTP lacks the Ad packaging signals. 

15 Therefore, in order to recover virus containing the 612 bp deletion within the pol gene from 
pBHGl lApol. this plasmid must be cotransfected into packaging cells along with DNA that 
provides a source of the Ad packaging signals. The Ad packaging signals may be provided 
by wild-type or mutant Ad viral DNA or alternatively may be provided using a shuttle vector 
which contains the left-end Ad5 sequences including the packaging signals such as 

20 pAElsplA. pAElsplB (Microbix) or pAdBglll. 

Recombinant Ad vectors containing the pTP deletion which contain a deletion within 
the E3 region are generated by cotransfection of pBHGl lApTP (the gene of interest is 
inserted into the unique Pad site of pBHGl lApTP) with a E3-deleted Ad virus such as 
Ad5dl700 into pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described 

25 above in Example 3. 

Recombinant vectors containing the pTP deletion which also contain deletions within 
the El and E3 regions are generated by cotransfection of pBHGHApTP into pTP-expressing 
293 cells (e.g.. C-7) along with a shuttle vecior containing the left-end Ad5 sequences 
including the packaging signals. Suitable shuttle vectors include pAElsplA (Microbix). 
pAElsplB (Microbix) or pAdBglll. The gene of interest is inserted into the polylinker region 
of the shuttle vector and this plasmid is then cotransfected into B-6 or C-7 cells along with 
pBHGl lApTP to generate a recombinant El", pTP" Ad vector containing the gene of 
interest. 
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EXAMPLE 5 

Production Of Adenovirus Vectors Deleted For 
El. Polymerase And Preterminal Protein Functions 



5 In order 10 Produce an Ad vectors deleted for El. polymerase and preterminal protein 

functions, a deletion encompassing pol and pTP gene sequences was introduced into the Ad 
sequences contained within an El -deleted Ad genome. The plasmid pBHGl 1AE4 was used 
as the source of an El-deleted Ad genome to maximize the cloning capacity of the resulting 
pol - . pTP" vector. pBHGl 1AE4 is a modified form of BHG1 1 which contains a deletion of 

10 all E4 genes except for the E4 ORF 6: the E4 region was deleted to create more room for the 
insertion of a gene of interest and to further disable the virus. However, other El " Ad 
backbones, such as pBHGll and pBHGl (Microbix; pBHGlO contains a smaller deletion 
within the E3 region as compared to pBHGl 1). may be employed for the construction of 
pol". pTP" vectors using the strategy outlined below. 

13 Due 10 tne complexity of the cloning steps required to introduce a 2.3 kb deletion that 

removes portions of both the pTP and pol genes, this deletion was generated using several 
steps as detailed below. 

a) Construction Of pAXBApolApTPVARNA+tl3 

20 ln order l o create a plasmid containing Ad sequences that have a deletion within the 

pol and pTP genes, pAXBApolApTPVARNA+tl3 was constructed as follows. 

pBSA-XB was digested with BspEl and the 18 kb fragment was isolated and 
^circularized to create pAXBApolApTP; this plasmid contains a deletion of the sequences 
contained between the BxpE] sites located at 8.773 and 12.513 (numbering relative to 

25 pBHGll). 

A fragment encoding the VA-RNA3 sequence and the third leader of the tri-partite 
leader sequence was prepared using the PCR as follows. The PCR was carried out in a 
solution containing H5ts36 virion DNA (any Ad DNA. including wild-type Ad. may be used). 
2 ng/mL of primers 4005E and 4006E. 10 mM TrisHCl. pH 8.3. 50 mM KC1. 1.5 mM 
30 MgCl,. 0.001% gelatin and Pfu polymerase. The forward primer. 4005E, [5 r - 

TGCCGCAGCACCGGATGCATC-3" (SEQ ID NO:6)] contains sequences complementary to 
residues 12.551 to 12.571 of pBHGl I (SEQ ID NO:4). The reverse primer. 4006E. [5 - - 
GCGTCCGGAGGCTGCCATG CGGCAGGG-3' (SEQ ID NO:7)] is complementary to 
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residues 1 1 .091 to 1 1.108 ofpBHGll (SEQ ID NO:4) as well as a fl^El sue (underlined). 
The predicted sequence of the -1.6 kb PCR product is listed in SEQ ID NO:8. 

PCR was performed with a Perkin Elmer 9600 Thermocycler utilizing the following 
cycling parameters: initial denaturation at 94°C for 3 min. 3 cycles of denaturation at 94°C 
for 30 sec. annealing at 50°C for 30 sec. and extension at 72°C for 60 sec. followed by 
another 27 cycles with an increased annealing temperature at 56°C. with a final extension at 
72°C for 10 minutes. The -1.6 kb PCR product was purified using a QIAEX Gel Extraction 
Kit as described (Ex. 3). 

The purified PCR fragment was then digested with BspEl. pAXBApolApTP was 
digested with BspEl followed by treatment with SAP. The ^El-digested PCR fragment and 
the AypEI-SAP-treated pAXBApolApTP were ligated together to create 
pAXBApolApTPVARNA+tl3. 

b) Construction Of pBHGHApolApTPVARNA+tl3 

pAXBApolApTPVARNA+tlS contains a 2.3 kb deletion within the pol and pTP genes 
(bp 8.773 to 1 1.091: numbering relative to that of pBHGl I) and lacks the 1 1.4 kb BamHl 
fragment containing the right arm of the Ad genome. 

To provide the right arm of the Ad genome, pAXBApolApTPVARNA+tl3 was 
digested with BamHl followed by treatment with SAP. pBHGl 1AE4 (pBHGl 1 or pBHGlO 
may be used in place of pBHGl 1AE4) was digested with BamHl and the 1 1.4 kb fragment 
was isolated and purified using a QIAEX Gel Extraction Kit as described (Ex. 3). The 
purified 1 1 .4 kb BamH] fragment was ligated to the BomHI/phosphatased 
pAXBApolApTPVARNA+tl3 to generate pBHGl 1 ApolApTPVARNA+tl3. Proper 
construction of pBHGl 1 ApolApTPVARNA+tl3 was confirmed restriction digestion. 

25 

c) Rescue And Propagation Of Ad Vectors Containing The pol, 
pTP Double Deletion 

The Ad genome contained within pBHG 1 1 ApolApTPVARNA+t 1 3 lacks the Ad 
packaging signals. Therefore, in order to recover virus containing the 2.3 kb deletion within 
30 the pol and pTP genes from pBHGl lApolApTPVARMA+tl3. this plasmid must be 
cotransfected into packaging cells along with DNA that provides a source of the Ad 
packaging signals. The Ad packaging signals may be provided by wild-type or mutant Ad 
viral DNA or alternatively may be provided using a shuttle vector which contains the left-end 
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Ad5 sequences including the packaging signals such as pAEIsplA. pAElspIB (Microbix) or 
pAdBglll. 

Recombinant vAd vectors containing the pol. pTP double deletion which also contain a 
deletion within the E3 region are generated by cotransfection of 
5 pBHGl 1 ApolApTPVARNA+tl3 (the gene of interest is inserted into the unique Pad site of 
pBHGl 1 ApolApTPVARNA+U3) with a E3-deleted Ad virus such as Ad5<//7001 into pol- and 
pTP-expressing 293 cells (e.g.. C-7); viral DNA-TPC are utilized as described (Ex. 3). 

Recombinant vectors containing the poL pTP double deletion which also contain 
deletions within the El and E3 regions are generated by cotransfection of 
10 pBHGl !ApolApTPVARNA+tl3 into pol- and pTP-expressing 293 cells (e.g.. C-7) along with 
a shuttle vector containing the left-end Ad5 sequences including the packaging signals. 
Suitable shuttle vectors include pAEIsplA (Microbix), pAElspIB (Microbix) or pAdBglll. 
The gene of interest is inserted into the polylinker region of the shuttle vector and this 
plasmid is then cotransfected into B-6 or C-7 cells along with 
15 pBHGl !ApolApTPVARNA+tl3 to generate a recombinant El", pol pTP" Ad vector 
containing the gene of interest. 

EXAMPLE 6 

Encapsidated Adenovirus Minichromosomes 
20 Containing A Full Length Dystrophin cDNA 

In this Example, the construction of an encapsidated adenovirus minichromosome 
(EAM) consisting of an infectious encapsidated linear genome containing Ad origins of 
replication, packaging signal elements, a [3-galactosidase reporter gene cassette and a full 

25 length (14 kb) dystrophin cDNA regulated by a muscle specific enhancer/promoter is 

described. EAMs are generated by cotransfecting 293 cells with supercoiled plasmid DNA 
(pAdSPdys) containing an embedded inverted origin of replication (and the remaining above 
elements) together with linear DNA from El -deleted virions expressing human placental 
alkaline phosphatase (hpAP). All proteins necessary for the generation of EAMs are provided 

30 in trans from the hpAP virions and the two can be separated from each other on equilibrium 
CsCl gradients. These EAMs are useful for gene transfer to a variety of cell types both in 
vitro and in vivo. 
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a) Generation And Propagation Of Encapsidatcd Adenovirus 
iMinichromosomes 

To establish a vector system capable of delivering full length dystrophin cDNA clones, 
the minimal region of Ad5 needed for replication and packaging was combined with a 
5 conventional plasmid carrying both dystrophin and a P-galactosidase reporter gene. In the 
first vector constructed, these elements were arranged such that the viral ITRs flanked (ITRs 
facing outward) the reporter gene and the dystrophin gene [i.e.. the vector contained from 5" 
to 3" the right or 3 - ITR (mu 100 to 99). the dystrophin gene and the reporter gene and the 
left or 5" ITR and packaging sequnce (mu 1 to 0)]. Upon introduction of this vector along 
10 with helper virus into 293 cells, no encapsidated adenovirus minichromosomes were 
recovered. 

The second and successful vector. P Ad5(3dys (Fig. 10) contains 2.1 kb of adenovirus 
DNA. together with a 14 kb murine dystrophin cDNA under the control of the mouse muscle 
creatine kinase enhancer/promoter, as well as a p-galactosidase gene regulated by the human 

1 5 cytomegalovirus enhancer/promoter. 

Figure 10 shows the structure of pAd5pdys (27.8 kb). The two inverted adenovirus 
origins of replication are represented by a left and right inverted terminal repeat 
(LITR/RITR). Replication from these ITRs generates a linear genome whose termini 
correspond to the 0 and 100 map unit (mu) locations. Orientation of the origin with respect 

20 to wild type adenovirus serotype 5 sequences in mu is indicated above the figure (1 mu=360 
bp). Encapsidation of the mature linear genome is enabled by five (AI-AV) packaging signals 
OP). The E. coli P-galactosidase and mus musculus dystrophin cDNAs are regulated by 
cytomegalovirus (CMV) and muscle creatine kinase (MCK) enhancer/promoter elements, 
respectively. Both expression cassettes contain the SV40 polyadenylation (pA) signal. Since 

25 the El A enhancer/promoter overlaps with the packaging signals. pAd5pdys was engineered 
such that RNA polymerase transcribing from the El A enhancer/promoter will encounter the 
SV40 late polyadenylation signal. Pertinent restriction sites used in constructing pAd5pdys 
are indicated below the figure. An adenovirus fragment corresponding to mu 6.97 to 7.77 
was introduced into pAdSPdys during the cloning procedure (described below). PI and P2 

30 represent location of probes used for Southern blot analysis. Restriction sites destroyed 
during the cloning of the Ad5 origin of replication and packaging signal are indicated in 
parentheses. 
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pAd5(Jdys was constructed as follows. pBSX (Ex. 3a) was used as the backbone for 
construction of pAdSpdys. The inverted fused Ad5 origin of replication and five 
encapsidation signals were excised as a PsiMXbal fragment from pAd5ori. a plasmid 
containing the 6 kb Hindi]] fragment from pFG140 (Microbix Biosystems). This strategy also 
5 introduced a 290 bp fragment from Ad5 corresponding to map units 6.97 to 7.77 adjacent to 
the right inverted repeat (see Fig. 10). The E. coli P-galactosidase gene regulated by the 
human CMV immediate early (IE) promoter/enhancer expression cassette was derived as an 
EcoJU/HindlU fragment from pCMVp [MacGregor and Caskey (1989) Nucleic Acids Res. 
17:2365]: the CMV IE enhancer/promoter is available from a number of suppliers as is the £. 

10 coli p-ealactosidase gene]. The murine dystrophin expression cassette was derived as a 
BssHll fragment from pCVAA. and contains a 3.3 kb MCK promoter/enhancer element 
[Phelps ei al. (1995) Hum. Mol. Genet. 4:1251 and Jaynes ei al. (1986) Mol. Cell. Biol. 
6:2855]. The sequence of the -3.3 kb MCK promoter/enhancer element is provided in SEQ ID 
NO:9 (in SEQ ID NO:9. the last nucleotide of SEQ ID NO:9 corresponds to nucleotide +7 of 

15 the MCK gene). The enhancer element is contained within a 206 bp fragment located -1256 
and - 1 050 upstream of the transcription start site in the MCK gene. 

It was hoped that the pAd5pdys plasmid would be packageable into an encapsidated 
minichromosome when grown in parallel with an El -deleted virus due to the inclusion of both 
inverted terminal repeats (ITR) and the major Ad packaging signals in the plasmid. The ITRs 

20 and packaging signals were derived from pFG140 (Microbix). a plasmid that generates El- 
defective Ad particles upon transfection of human 293 cells. 

hpAP is an El deleted Ad5 containing the human placental alkaline phosphatase gene 
[Muller ei al. (1994) Circ. Res. 75:1039]. This virus was chosen to provide the helper 
functions so that it would be possible to monitor the titer of the helper virus throughout serial 

25 passages by quantitative alkaline phosphatase assays. 

293 cells were cotransfected with pAd5pdys and hpAP DNA as follows. Low passage 
293 cells (Microbix Biosystems) were grown and passaged as suggested by the supplier. Five 
pAd5pdys and hpAP DNA (5 and 0.5 u.g, respectively) were dissolved in 70 u.1 of 20 mM 
HEPES buffer (pH 7.4) and incubated with 30 ul of DOTAP (BMB) for 15 min. at room 

30 temperature. This mixture was resuspended in 2 mis of DMEM supplemented with 2% fetal 
calf serum (FCS) and added dropwise to a 60 mm plate of 293 cells at 80% confluency. Four 
hours post-transfection the media was replaced by DMEM with 10% FCS. Cytopathic effect 
was observed 6-12 days post-transfection. 



- 48 - 



10 



20 



30 



W0 98/17783 PCT/US97/19541 

Coiransiection of 293 cells with supercoiled pAdSpdys and linear hpAP DNA 
produced AdSpdys EAMs (the encapsidated version of pAdSpdys) approximately 6 to 12 days 
post transfection. as evidenced by the appearance of a cytopathic effect (CPE). Initially, the 
amount of AdSpdys EAMs produced was significantly lower than that of hpAP virions" so the 
viral suspension was used to re-infect fresh cultures, from which virus was isolated and used 
for serial infection of additional cultures (Fig. 11). Infection and serial passaging of 293 cells 
was carried out as follows. 

Lysate from one 60 mm plate of translated 293 cells was prepared by vigorously 
washing the cells from the plate and centrifuging at I K rpm in a clinical centrifuge. Cells 
were resuspended in DMEM and 2% FCS. freeze-thawed in a dry ice-ethanol bath! cell debris 
removed by centrifugation. and approximately 75% of the crude lysate was used to infect 293 
cells in DMEM supplemented with 2% FCS for 1 hour and then supplemented with 10% FCS 
thereafter. Infection was allowed to proceed for 18-20 hrs before harvesting the virus. The 
total number of cells infected in each serial passage is indicated in Figure 1 1. 

In Figure 11. the total number of transducing adenovirus particles produced (output) 
per serial passage on 293 ceils, total input virus of either the helper (hpAP) or Ad5pdys. and 
the total number of cells used in each infection is presented. The total number of input/output 
transducing particles were determined by infection of 293 cells plated in 6- well microliter 
plates. Twenty four hours post-infection the cells were assayed for alkaline phosphatase or P- 
galactosidase activity (described below) to determine the number of cells transduced with 
either AdSpdys or hpAP. The number of transducing particles were estimated by 
extrapolation of the mean calculated from 31 randomly chosen 2.5 mm : sectors of a 961 mnr 
plate. The intra-sector differences in total output of each type of virus are presented as the 
standard deviations, c. in Figure 11. For each serial passage. 75% of the total output virus 
from the previous passage was used for infection. 

Alkaline phosphatase or p-galactosidase activity was determined as follows. For 
detection of alkaline phosphatase, infected 293 cells on Petri dishes were rinsed twice with 
phosphate buffered saline (PBS) and fixed for 10 minutes in 0.5% glutaraldehyde in PBS. 
Cells were again rinsed twice with PBS for ten minutes followed by inactivation of 
endogenous alkaline phosphatase activity at 65°C for 1 nr. in PBS prior to the addition of the 
chromogenic substrate BCIP (5-bromo-4-chloro-3-indolyl phosphate) at 0.15 mg/ml and nitro 
blue tetrazolium at 0.3 mg/ml) . Cells were incubated at 37°C in darkness for 3- 24 hrs. For 
P-galactosidase assays, the cells were fixed and washed as above, then assayed using standard 
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methods [MacGregor et al (1991) In Murray (ed.). Methods in Molecular Biolog\: Vol ". 

Gene Transfer and Expression Protocols. Human Press Inc.. Clifton. NJ. pp. 217-225]. 
As shown in Figure 1 1. the rate of increase in titer of Ad5(idys EAMs between 

transfection and serial passage 6 was approximately 100 limes greater than that for hpAP 
5 virions. This result indicated that Ad5pdys has a replication advantage over the helper virus. 

probably due to the shorter genome length (a difference of approximately 8 kb) and hence an 

increased rate of packaging. 

Interestingly, after serial passage 6 there was a rapid decrease in the total titer of hpAP 

virions whereas the titer of Ad5pdys EAMs continued to rise. While not limiting the present 
10 invention to any particular mechanism, at least two possible mechanisms could be responsible 

for this observation. Firstly, a buildup of defective hpAP virions due to infections at high 

multiplicities may slowly out-compete their full length counterparts, a phenomenon that has 

been previously observed upon serial propagation of adenovirus [Daniell (1976) J. Virol. 

19:685: Rosenwirth et al. (1974) Virology 60:431: and Burlingham et al. (1974) Virology 
15 60:419]. Secondly, the emergence of replication competent virions due to recombination 

events between El sequences in cellular DNA and the hpAP genome could lead to a buildup 

of virus particles defective in expressing alkaline phosphatase fLochmuller et al. (1994) Hum. 

Gene Ther. 5:1485]. 

Southern analysis of DNA prepared from serial lysates 3. 6. 9 and 12 indicated that 
20 full length dystrophin sequences were present in each of these lysates (Fig. 12A). In addition, 
the correct size restriction fragments were detected using both dystrophin and p-ealactosidase 
probes against lysate DNA digested with several enzymes (Figs. 12A-B). 

Figure 12 shows a Southern blot analysis of viral DNA from lysates 3. 6. 9 and 12. 
digested with the restriction enzymes Bsslill. Nrul and £coRV. indicating the presence of a 
25 full length dystrophin cDNA in all lysates. Fragments from the C terminus of mus musculus 
dystrophin cDNA (A) or the N terminus of E coli p-galactosidase (B) were labeled with 
dCTP j: and used as probes [Sambrook et al. supra). The position of these probes and the 
predicted fragments for each digest is indicated in Figure 10. Note that one end of each 
fragment (except the 17.8 kb BssH\] dystrophin fragment) detected is derived from the end of 
30 the linearized Ad5pdys genome (see Fig. 10). Low levels of shorter products, presumably 
derived from defective virions, become detectable only at high serial passage number. 

At the later passages (9 and 12) there appeared to be an emergence of truncated 
AdSpdys sequences, suggesting that deletions and/or rearrangements may be occurring at later 
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passages. Hence, mosi experiments were performed with Ad5Pdys EAMs derived from the 
earlier passages. The possibility of the emergence of replicat.on competent (El -containing) 
viruses was also examined by infection of HeLa cells by purified and crude serial lysates.^ 
none of which produced any detectable CPE. 

b) Purification Of Encapsidated Adenovirus Minichromosomes 
On CsCI Gradients 

The ability to separate Ad5(3dys EAMs from hpAP virions based on their buoyancy 
difference (due presumably to their different genome lengths) on CsCI gradients was 
examined. Repeated fractionation of the viral lysate allows small differences in buoyancy to 
be resolved. CsCI purification of encapsidated adenovirus minichromosomes was performed 
as follows. Approximately 25% of the lysate prepared from various passages during serial 
infect.ons was used to purify virions. Freeze-thawed lysate was centrifuged to remove the 
cell debris. The cleared lysate was extracted twice with 1.1.2 tricholorotrifluoroethane 
(Sigma) and applied to CsCI step and self forming gradients. 

Purification of virus was initially achieved by passing it twice through CsCI step 
gradients with densities of p=1.45 and p=1.20 in a SW28 rotor (Beckman). After isolation of 
the major band in the lower gradient, the virus was passed through a self forming gradient 
(initial p=1.334) at 37.000 rpm for 24 hrs followed by a relaxation of the gradient by 
reducing the speed to 10.000 rpm for 10 hrs. in a SW41 rotor (Beckman) at 12°C [Anet and 
Strayer (1969) Biochem. Bicphys. Res. Commun. 34:328]. The upper band from the gradient 
(composed mainly of AdSpdys virions) was isolated using an 18 gauge needle, reloaded on a 
fourth CsCI gradient (p=1.334) and purified at 37.000 rpm for 24 hrs followed by 10.000 rpm 
for 10 hrs at 12°C. 

The AdSpdys-containing CsCI band was removed in 100 pi fractions from the top of 
the centrifugation tube and CsCI was removed by chromatography on Sephadex G-50 
(Pharmacia). Aliquots from each fraction were used to infect 293 cells followed by p. 
galactosidase and alkaline phosphatase assays to quantitate the level of contamination by 
hpAP virions in the final viral isolate. 

Results of the physical separation between Ad5pdys EAMs and hpAP virions are 
shown in Figure 13. Figure 13 shows the physical separation of AdSpdys from hpAP virions 
at the third (A) and final (B) stages of CsCI purification (initial p- 1.334) in a SW41 tube. 
Aliquots of Ad5pdys EAMs from the final stage were drawn through the top of the 
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centrifugation tube and assayed for P-galactosidase and alkaline phosphatase expression. 
Results of these assays are presented in Figure 14. 

In Figure 14. shows the level of contamination of AdSpdys EAMs by hpAP virions 
obtained from passage 6. Following four cycles of CsCl purification, aliquots were removed 
5 from the top of the centrifugation tube and used for infection of 293 cells, which were fixed 
24 hrs later and assayed for P-galactosidase or alkaline phosphatase activity (described below). 
The number of transducing particles are an underestimation of the actual totals as in some 
cases a positive cell may have been infected by more than one transducing particle. The ratio 
of the two types of virions - AdSpdys EAMs (LacZ) or hpAP (AP) in each fraction is 
10 indicated in the lower graph. 

The maximum ratio of transducing AdSpdys EAMs to hpAP virions reproducibly 
achieved in this study was 24.8 - a contamination with helper virus corresponding to 
approximately 4% of the final viral isolate. 

c) Dystrophin And /J-galactosidase Expression By Encapsidated 
Adenovirus Minichromosomes In Muscle Cells 

To determine if the AdSpdys EAMs were able to express p-galactosidase and 
dystrophin in muscle cells, mouse mdx myogenic cultures were infected with CsCl purified 
EAMs. 

i) Propagation And Infection Of Muscle Cells 

MM 14 and mdx myogenic cell lines were kindly provided by S. Hauschka (University 
of Washington) and were cultured as previously described [Linkhart ei ai (1981) Dev. Biol. 
86:19 and Clegg et al. (1987) J. Cell Biol. 105:949]. Briefly, myoblasts were grown on 
plastic tissue culture plates coated with a 0.1% gelatin in Ham's F-10 medium containing 15% 
(v/v) horse serum. 0.8 mM CaCK. 200 ng/ml recombinant human basic fibroblast growh 
factor (b-FGF) and 60 pg/ml genitimicin (proliferation medium). Cutures were induced to 
differentiate by switching to growth in the presence of growth medium lacking b-FGF and 
containing 10% horse serum (differentiation medium). Myoblasts or differentiated myotubes 
(three days post switching) were infected at a multiplicity of infection of 2.2 AdSpdys EAMs 
per cell. Fractions containing minimal contamination with hpAP virions (3. 4 and 5 of 
passage 6) were used for western and immunofluorescence analysis. Infection was allowed to 
proceed for 3 days for both the myoblasts and myotubes before, harvesting cells. 
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ii) Total Protein Extraction And Immunoblot 
Analysis 

For protein extraction, muscle cells were briefly trypsinized. transferred to a 
microcentrifuge tube, centred at 14 K. for 3 min at room temp and .suspended two times 
m PBS. After an additional centrifugation. the cell pellet was .suspended in 80 ul of RJPA 
buffer (50 mM Tris-Cl. P H 7.5; 150 mM NaCI: 1% Nonidet P-40: 0.5% sodium 
deoxycholate: 0.1% SDS) (Sambrook e, ai. supra). The sample was briefly sheared using a 
22 gauge needle to reduce viscosity and total protein concentration assayed using the 
bicinccninic acid protein assay reagent (Pierce. Rockford. IL). Expression of full length 
dystrophin or P-galactosidase in infected mdx and MM 14 myoblasts or myotubes was 
analyzed by electrophoresis of 40 ug of total protein extract on a 6% SDS-PAGE gel (in 25 
mM Tris. 192 mM glycine. 10 mM P-mercaptoethanol. 0.1% SDS). After transferring to"" 
Gelman Bictrace NT membrane (in 25 mM Tris. 192 mM e lycine. 10 mM P-mercaptoethanol. 
0.05% SDS. 20% methanol), the membrane was blocked with 5% non-fat milk and 1% goat- 
serum in Tris-buffered saline-Tween (TBS-T) for 12 hrs at 4°C. Immunostaining was done 
according to the protocol for the ECL western blotting detection reagents (Amersham Life 
Sciences. Buckingham. UK). The primary antibodies used were Dys-2 (Vector Laboratories) 
and anti-p-galactosidase (8MB. Indianapolis. IN) with a horseradish peroxidase-conjugated 
anti-mouse secondary antibody. 

Western blot analysis of EAM-infected mdx myoblasts and myotubes (three days post- 
fusion) indicated that EAMs were able to infect both of these cell types (Figure 15). In 
Figure 15. immunoblots of protein extracts from mdx myoblasts and myotubes demonstrating 
the expression of P-galactosidase (A) and dystrophin (B) in cells infected with Ad5pdys 
EAMs. Total protein was extracted 3 days post infection in all cases. Myotubes were 
infected at three days following a switch to differentiation media. In Figure 15A, lane 1 
contains total protein extract from 293 cells infected with a virus expressing P-galactosidase 
as. a control (RSV-LacZ); lanes 2-5 contain total protein extracts from uninfected mdx 
myoblasts, mdx myoblasts infected with Ad5Pdys EAMs, mdx myotubes and mdx myotubes 
derived from mdx myoblasts infected with Ad5pdys EAMs. respectively. In Figure 15B. 
lanes I and 7 contain total protein from mouse muscle ("C57") while lane 2 contains protein 
from wild type MM14 myotubes. as controls. Lanes 3-5 contain total protein extracts from 
uninfected mdx myoblasts, mdx myoblasts infected with Ad5pdys EAMs. mdx myotubes and 
mdx myotubes derived from mdx myoblasts infected with Ad5pdys EAMs. respectively. 
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As shown in Figure 15. expression of P-galactosidase was detected in both the infected 
max myoblasts and myotubes indicating that the CMV promoter was active at both these earlv 
stages of differentiation in muscle cells. However, only infected mdx myotubes produced 
protein detected by dys-2. an antibody recognizing the 17 C-terminal amino acids of 
dystrophin (Fig. 15). No dystrophin expression was detected in infected myoblasts by western 
analysis, indicating that the muscle creatine kinase promoter functions minimally, if at all. 
within the AdSpdys EAM prior to terminal differentiation of these cells. 

Dystrophin expression in mdx cells infected with EAMs was confirmed by 
immunofluorescence studies using N-terminal dystrophin antibodies. In agreement with the 
western analysis, dystrophin expression from the MCK promoter was detected only in 
differentiated mdx myotubes infected by AdSpdys EAMs (Fig. 16). Figures I6A-C show 
immunofluorescence of dystrophin in wild type MM14 myotubes (A), uninfected mdx (B) and 
infected mdx myotubes (C), respectively. The results shown in Figure 16 demonstrate the 
transfer and expression of recombinant dystrophin to differentiated mdx cells by AdSpdys 
EAMs. 

Immunofluorescence of myogenic cells was performed as follows. Approximately 1.5 
x 10 h MM14 or mdx myoblasts were plated on Poly-L-lysine (Sigma) coated glass slides (7 x 
3 cm) which had been previously etched with a 0.05% chromium potassium sulfate and 0.1% 
gelatin solution. For myotube analysis, the cultures were switched to differentiation media 
fClegg et ai (1987), supra] 48 hours after plating, immediately infected and then allowed to 
fuse for 3 days, whereas myoblasts were continuously propagated in proliferation media 
fClegg et ai (1987). supra]. Cells were washed three limes with PBS at room temperature 
and iixed in 3.7% formaldehyde. For immunostaining. cells were incubated in 0.5% Triton 
X-100. blocked with 1% normal goat serum and incubated with an affinity purified antibody 
against the N-terminus of murine dystrophin for 2 hrs. followed by extensive washing in PBS 
and 0.1% Tween-20 with gentle shaking. Cells were incubated with a 1:200 dilution of biotin 
conjugated anti-rabbit antibody (Pierce) for one hour and washed as above. Cells were 
further incubated with a 1:300 dilution of streptavidin-fluorescein isothiocyanate conjugate 
(Vectorlabs. Burlingam. CA) for one hour and washed as above, followed bv extensive 
washing in PBS. 

The above results show that embedded inverted Ad origins of replication coupled to an 
encapsidation signal can convert circular DNA molecules to linear forms in the presence of 
helper virus and that these genomes can be efficiently encapsidated and propagated to high 
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mers. Such viruses can be purified on a CsCI gradient and maintain their ability to transduce 
cells /„ vara, and their increased cloning capacity allows the inclusion of large genes and 
.issue specific gene regulatory elements. The above results also show that the dystrophin ,ene 
was expressed in cells transduced by such viruses and that the protein product was correctly 
localized to the cell membrane. The above method for preparing EAMs theoretically enables 
virtually any gene of interest to be inserted into an infectious minichromosome bv 
conventional cloning in plasmid vectors, followed by cotransfection with helper viral DNA in 
293 cells. This approach is useful for a variety of gene transfer studies in vitro. The 
observation that vectors completely lacking viral genes can be used to transfer a full-length 
dystrophin cDNA into myogenic cells indicates that this method may be used for the 
treatment of DMD using gene therapy. 

d) A Modified MCK Enhancer Increases Expression Of Linked 
Genes In Muscle 

The DNA fragment containing enhancer/promoter of the MCK gene utilized in the 
AdSpdys EAM plasmid is quite large (-3.3. kb). In order to provide a smaller DNA fragment 
capable of directing high levels of expression of linked genes in muscle cells, portions of the 
3.3 kb MCK enhancer/promoter were deleted and/or modified and inserted in front of a 
reporter gene {lacZ). The enhancer element of the MCK gene was modified to produce the 
2RS5 enhancer: the sequence of the 2RS5 enhancer is provided in SEQ ID NO:10. The first 
6 residues of SEQ ID NO: 1 0 represent a Kpn\ site added for ease of manipulation of the 
modified MCK enhancer element. Residue number 7 of SEQ ID NO:10 corresponds to 
residue number 2164 of the wild-type MCK enhancer sequence listed in SEQ ID N0:9 
(position 2164 of SEQ ID N0:9 corresponds to position -1256 of the MCK uene). Residue 
number 174 of SEQ ID NO: 10 corresponds to residue number 2266 of the wild-type MCK 
enhancer sequence listed in SEQ ID NO:9. 

These MCK/lacZ constructs were used to transfect cells in culture (/.*.. myogenic 
cultures) or were injected as naked DNA into the muscle of mice and P-galactosidase activity 
was measured. Figure 17 provides a schematic of the MCK/lacZ constructs tested. The first 
construct shown in Figure 17 contains the 3.3. kb wild-type MCK enhancer/promoter 
fragment linked to the E. coli lacZ gene. The wild-type enhancer element (-1256 to -1056) 
is depicted by the box containing "E": the core promoter element (-358 to -80) is indicated by 
the light cross-hatching and the minimal promoter element (-80 to +7) is indicated by the dark 
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cross-hatching. The core promoter element is required in addition to the minimal promoter 
element (which is required for basal expression) in order to achieve increased muscle-specific 
expression in tissue culture [Shield ei al. (1996) Mol. Cell. Biol. 16:5058]. The modified 
enhancer element, the 2RS5 enhancer, is indicated by the box labeled "E*." The box labeled 
"mi nx" contains a synthetic intron derived from adenovirus and is used to increase expression 
of the constructs (any intron may be utilized for this purpose). The box labeled "CMV" 
depicts the CMV IE enhancer/promoter which was used a positive control. 

In figure 17. 3-gaiactosidase activity is expressed relative to the activity of the wild- 
type MCK enhancer/promoter construct shown at the top of the figure. As shown in Figure 

10 17. a construct containing the 2RS5 enhancer (SEQ ID NO: 10) linked to either the minimal 

MCK promoter (a - 261 bp element) or the core and minimal MCK promoter elements (a -539 
bp element) directs higher levels of expression of the reporter gene in muscle cells as 
compared to the -3.3 kb fragment containing the wild-type enhancer element. These modified 
enhancer/promoter elements are considerably smaller than the -3.3 kb fragment used in the 

15 Ad5|3dys EAM plasmid and are useful for directing the expression of foreign genes in muscle 
cells. These smaller elements are particularly useful for driving the expression of genes in the 
context of self-propagating adenoviral vectors which have more severe constraints on the 
amount of foreign DNA which can be inserted in comparison to the used of "gutted" 
adenoviruses such as the EAMs described above. 

20 

EXAMPLE 7 

Generation Of High Titer Stocks Of Encapsidated Adenovirus 
Minichromosomes Containing Minimal Helper Virus Contamination 

25 The results presented in Example 6 demonstrated that encapsidated adenovirus 

minichromosomes (EAMs) can be prepared that lack all viral genes and which can express 
full-length dystrophin cDNAs in a muscle specific manner. The propagation of these EAMs 
requires the presence of helper adenoviruses that contaminate the final EAM preparation with 
conventional adenoviruses (about 4% of the total preparation). In this example the EAM 

30 system is modified to enable the generation of high titer stocks of EAMs with minimal helper 
virus contamination. Preferably the EAM stocks contain helper virus representing less than 
1%. preferably less than 0.1% and most preferably less than 0.01% of the final viral isolate. 
Purified EAMs are then injected in vivo in muscles of dystrophin minus mdx mice to 
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determine whether these vectors lead to immune rejection and whether they can alleviate 
dystrophic symptoms in mdx muscles. 

The amount of helper virus present in the EAM preparations is reduced in two ways. 
The first is by selectively controlling the relative packaging efficiency of the helper virus 
versus the EAM virus. The second is to improve physical methods for separating EAM from 
helper v irus. These approaches enable the generation of dystrophin-expressing EAMs that are 
contaminated with minimal levels of helper virus. 

a) Development And Characterization Of Adenovirus Packaging 
Cell Lines Expressing The Cre Recombinase From 
Bacteriophage PI 

Cell lines expressing a range of Cre levels are used to optimize the amount of helper 
virus packaging that occurs during growth of the EAM vectors. The Cre-loxP system [Sauer 
and Henderson (1988) Proc. Natl. Acad. Sci. USA 85:5166] is employed to selectively disable 
helper v irus packaging during growth of EAMs. The bacterial Cre recombinase catalyzes 
efficient recombination between a 34 bp target sequence called loxP. To delete a desired 
sequence. loxP sites are placed at each end of the sequence to be deleted in the same 
orientation: in the presence of Cre recombinase the intervening DNA segment is efficiently 
excised. 

Cell lines expressing a range of Cre recombinase levels are generated. The expression 
of too linle Cre protein may result in high levels of helper virus being generated, which leads 
to unacceptably high levels of helper virus contaminating the final EAM preparation. If very 
high lev els of Cre expression are present in a cell line, excision of the packaging signal from 
the helper virus would be 100% efficient (/.*.. it would completely prevent helper virus 
packaging). As shown in Example 6. serial passage of EAM preparations containing low 
levels of helper virus increased the titer of the EAM. Therefore, it is desirable, at least in the 
initial passages of a serial passage that some helper virus capable of being packaged is 
present. A low level of packagable helper virus may be provided by using a cell line 
expressing levels of Cre recombinase which are not high enough to achieve excision of the 
packaging signals from 100% of the helper virus: these cell lines would be used early in the 
serial passaging of the EAM stock and a cell line expressing high enough levels of Cre 
recombinanse to completely prevent helper virus packaging would be used for the final 
passage. 
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Alternatively. EAMs may be prepared using a packaging cell line that supports high 
efficiency Cre recombinase-mediated excision of the packaging signals from the helper virus 
by transfection of the CVe-expressing cell line with the EAM plasmid followed by infection of 
these cells with /avP-comaning helper virus using an MOI of > 1.0. 

Human 293 cell lines that express a variety of levels of Cre recornbinase are generated 
as follows. An expression vector containing the Cre coding region. pOG231. was 
cotransfected into 293 cells along with pcDNA3 (Invitrogen): pcDNAS contains the neo gene 
under the control of the SV40 promoter. pOG231 uses the human CMV enhancer/promoter 
[derived from CDM8 (Invitrogen)] to express a modified Cre gene [pOG231 was obtained 
from S. CTGorman]. The modified Cre gene contained within pOG231 has had a nuclear 
localization signal inserted into the coding region to increase the efficiency of recombination. 

pOG231 was constructed as follows. A BglU site was introduced into the 5" Xba\ site 
of the symhethic intron of pMLSISCAT [Haung and Gorman (1990) Nucleic Acids Res. 
18:937] by linker tailing. A BglU site in the symhethic intron of pMLSISCAT was destroyed 
and a BamHl linker was inserted into the Pstl site at the 3" end of the symhethic intron in 
pMLSISCAT. BglU and Smal sites and a nuclear localization signal were introduced into the 
5* end of pMC-Cre [Gu et ai (1993) Cell 73:1 155] using a PCR fragment that extended from 
the novel BglU and Smal sites to the BamHl site in the Cre coding region. This PCR 
fragment was ligated to a BamHMSall fragment containing a portion of the Cre coding region 
derived trom pIC-Cre (Gu et ai, supra) and the intron plus Cre coding sequence was inserted 
into a modified form of pOG44 [O'Gorman et ai (1991) Science 251:1351] to generate 
pOG23L The predicted sequence of pOG231 from the BglU site to the BamWl site located in 
the middle of the Cre coding sequence is listed in SEQ ID NO:] 1. 

One 60 mm dish of 293 cells (Microbix) were transfected with 10 jag of PvuII- 
linearized pOG231 and 1 \x% of Afo/I-linearized pcDNA3 using a standard calcium phosphate 
precipitation protocol. Two days after the addition of DNA. the transfected cells were split 
into three 100 mm dishes and 1000 ng/ml of active G418 was added to the medium. The 
cells were fed periodically with G418-contaning medium and three weeks later. 24 G418- 
resistant clones were isolated. 

The isolated clones were expanded for testing. Aliquots were frozen in liquid nitrogen 
at the earliest possible passage. The neomycin resistant cell lines were examined for the 
expression of Cre recornbinase using following transfection assay. The neomycin resistant 
cells were transfected with PGK-l-GFP-lacZ (obtained from Sally Camper. Univ. of 
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Michigan. Ann Arbor. MI), which contains a green fluorescent protein (GFP) expression 
cassette that can be excised by Cre recombinase to allow expression of P-galactosidase: 
transfection was accomplised using standard calcium phosphate precipitation. Figure 18 
provides a schematic of a GFP/p-gal reporter construct suitable for assaying the expression of 
Cre recombinase in mammalian cells: GFP sequences and P-gal (i.e.. lacZ) sequences are 
avaialbale from commmerical sources (e.g.. Clonetech. Palo Alto. CA and Pharmacia Biotech. 
Piscataway. NJ. respectively). 

Control experiments verified that 293 cells transfected with PGK- 1-GFP-lacZ 
expressed significant amounts of P-galactosidase only if these cells also expressed Cre 
recombinanse. p-galactosidase assays were performed as described in Example 6. Neomycin 
resistant cells expressing Cre recombinase were grouped as high, medium or low expressors 
based upon the amount of P-galactosidase activity produced (estimated by direct counting of 
P-galactosidase-positive cells per high-power Held and by observing the level of staining and 
the rapidity with which the blue stain was apparent) when these cell lines were transfected 
with the GFP/p-gal reporter construct. Thirteen positive (i.e.. Oc-expressing) lines. D608#12. 
#7. #22. #18. #17. #4, #8. #2, #2/2. #13. #5. #15 and #21 were retained for further use. 

The results of this transfection analysis revealed that cultures of 293 cells expressing 
medium to high levels of Cre recombinase could be generated without apparent toxicity. 

b) Generation Of Helper Adenovirus Strains That Contain loxP 
Sites Flanking The Adenovirus Packaging Signals 

Studies of EAM production demonstrated that the EAM vector has a packaging 
advantage over the helper adenovirus (Ex. 6). While not limiting the present invention to a 
particular mechanism, it is hypothesized that this packaging and replication advantage can be 
greatly increased by using helper viruses that approach the packaging size limits of Ad5 [Bett 
ei al. (1993) J. Virol. 67:591 1], by using viruses with mutations in E4 and/or E2 genes [Yeh 
ei al. (1996) J. Virol. 70:559; Gorziglia ex al. (1996) J. Virol. 70:4173: and Amalfitano et al. 
(1996) Proc. Natl. Acad. Sci. USA 93:3352]. by inclusion of mutations or alterations in the 
packaging signals of the helper virus [Imler ei al. (1995) Hum. Gene Ther. 6:71 1] and by 
combining these strategies. 

The Cre-loxP excision method is used to disable the packaging signals from the helper 
virus genomes. The Ad5 packaging domain extends from nucleotide 194 to 358 and is 
composed of five distinct elements that are functionally redundant [Hearing ei al. (1987) J. 
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Virol. 61:2555]. Theoretically, any molecule containing the Ad5 origin of replication and 
packaging elements should replicate and be packaged into mature virions in the presence of 
non-defective helper virus. Disabling the packaging signals should allow replication and gene 
expression to proceed, but will prevent packaging of viral DNA into infectious particles. This 
5 in turn should allow the ratio of EAM to helper virus to be increased greatly. 

To disable the packaging signals within the helper virus used to encapsidate the EAMs. 
the loxP sequences are incorporated into a helper virus that has a genome approaching the 
maximal packaging size for Ad [-105 map units: Bett ei al. (1993). supra] which further 
decreases the efficiency of helper virus packaging. Final virus size can be adjusted by the 

10 choice of introns inserted into the reporter gene, by the choice of reporter genes, or by 
including a variety of DNA fragments of various sizes to act as "sniffer" fragments. A 
convenient reporter gene is the alkaline phosphatase gene (see Ex. 6). The optimized Cre- 
loxP system is also incorporated into a helper viral backbone containing disruptions of any or 
all of the E1-E4 genes. Use of such deleted genomes requires viral growth on appropriate 

15 complementing cell lines, such as 293 cells expressing E2. and/or E4 gene products. The 
loxP sequences are incorporated into the helper virus by placing the loxP 
sequences on either side of the packaging signals on a shuttle vector. This modified shuttle 
vector is then used to recombine with the Ad DNA derived from a virus containing 
disruptions of any or all of the E1-E4 genes to produce the desired helper virus containing the 

20 packaging signals flanked by loxP sequences. 

pAdBglll. an adenovirus shuttle plasmid containing Ad sequences from 0-1 and 9-16 
map units (mu) was used as the starting material. Synthetic oligonucleotides were used to 
create a polylinker which was inserted at 1 mu within pAdBglll as follows. The Bglll LoxP 
oligo [5--GAAGATCTATAACTTCGTATAATGTATGCTA 

25 TACGAAGTTATTACCGAAGAAATGGCTCGAGATCTTC-3- (SEQ ID NO:12) and its 
reverese complement [5"-GAAGATCTCGAGCCATTTCTTCGGTAATAACTTCGT 
ATAGCATACATTATACGAAGTTATAGATCTTC-3' (SEQ ID NO: 13)] were synthesized. 
The Afllll LoxP oligo [5'-CCACATGTATAACTTCGTATAGCATACA 
TTATACG A AGTTATAC ATGTGG-3 T (SEQ ID NO: 14)] and its reverse complement [5"- 

30 CCACATGTATAACTTCGTATAATGTATGCTATACGAAGTTATACATG TGG-3 1 (SEQ 
ID NO: 15)] were synthesized. The double stranded form of each loxP oligonuceotide was 
digested with the appropriate restriction enzyme (e.g., Bglll or Afllll) and inserted into 
pAdBglll which had been digested with Bgltt and Afllll This resulted in the insertion of 
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LoxP sequences into the shuttle vector flanking the packaging signals that are located between 
0.5 and 1 mu of Ad5 (one mu equals 360 bp in the Ad5 genome: the sequence of the Ad5 
genome is Isiied in SEQ ID NO:l). The 3" loxP sequence was inserted into the Bgto site 
within pAdBglll. The y loxP sequence was inserted into the .4/7111 site located at base 143 
(-0.8 mu) (numbering relative to Ad5). 

DNA sequencing was used to verify the final structure of the modified shuttle plasmid 
between 0-1 mu. If O e recombinase-mediated excision of the packaging signals is found to 
be too efficient (as judged by the production of too little helper virus), alternate sites for the 
insertion of the loxP sequences are used that would result in deletion of 2. 3 or 4 packaging 
sequences rather than all 5 [Grable and Hearing (1990) J. Virol. 64:2047]. The insertion of 
loxP sequences at sites along the Ad genome contained within the shuttle vector which would 
result in the deletion 2. 3 or 4 packaging sequences are easily made using the technique of 
recombinant PCR [Higuchi (1990) In: PCR Protocols: A Guide ,o Methods and 
Applications. Innis ei al. (eds.) Academic Press. San Diego. CA, pp. 177-183]. The optimal 
amount of Cre recombinase-mediated excision of the packaging signals is that amount which 
permits the production of enough packaged helper virus to permit the slow spread of virus on 
the first plate of cells co-transfected with helper virus DNA (containing the loxP sequences) 
and the EAM vector. This permits serial passage of the EAM preparation onto a subsequent 
lawn of cells to increase the titer of the EAM preparation. Alternatively, if Oe recombinase- 
mediated excision of the packaging signals is essentially 100% efficient in all Cre-expressing 
cells lines (i.e.. regardless of the level of Oe-expression. that is even a cell line expressing a 
low level of Cre as judged by the GFP/fJ-gal assay described above), the packaged EAMs 
may be used along with helper virus (used at a MOI of -1.0) to infect the second or 
subsequent lawn of cells to permit serial passaging to increase the titer of the EAM 
preparation. 

Following introduction of the loxP sequences into the shuttle vector, the human 
placental alkaline phosphatase (HpAp) cDNA under control of the RSV promoter was inserted 
into the polylinker to provide a reporter gene for the helper virus. This is the same reporter 
used previously during EAM generation (Ex. 6). The HpAp sequences were inserted as 
follows. The W-containing shuttle vector was linearized with Xho\ and the hpAp cassette 
was ligated into the Xho\ site (a Xho\ site was inserted into pAdBglll during the insertion of 
the loxP sequences as a Xhol site was located on the 3" end of the loxP sequences inserted 
into the Balll site of pAdBglll). The HpAp-cassette was constructed as follows. 
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pRSVhAPT40 (obiained from Gary Nabel. Univ. of Michigan. Ann Arbor. MI) was digested 
with EcoR] 10 generate an £eoRJ fragment containing the HpAP cDNA and the SV40 intron 
and polyadcnylaiion sequences. pRc/RSV (Invitrogen) was digested with HindlU. then 
partially digested with £coRJ. A 5.208 bp fragment was then size selected on an agarose gel. 
treated with calf alkaline phosphatase, and ligated to the EcoRl fragment derived from 
pRSVhAPT40 to generate pRc/RSVAP. pRc/RSVAP was then digested with Sail and Xhol to 
liberate the RSV promoter linked to the HpAP cDNA cassette (including the SV40 intron and 
polyadenylation sequences). This SalhXhol fragment was inserted into the /caP-containing 
shuttle vector which had been digested with Sal] and Xlw\ to generate pADLoxP-RSVAP. 

Helper virus containing loxP sites flanking the Ad packaging signals is generated by 
co-transfection of the LoxP shuttle plasmid (pADLoxP-RSVAP) and C/d-digested 
Ad5J/7001 DNA into 293 cells [Graham and Prevec (1991) Manipulation of Adenovirus 
Vectors. In Methods in Molecular Biology, Vol. 7: Gene Transfer and Expression Protocols. 
Murray (ed.). Humana Press Inc.. Clifton. NJ, pp. 109-128]. Figure 19 provides a schematic 
15 showing the recombination event between the loxP shuttle vector and the Ad5J/7001 genome. 
Co-transfection is carried out as described in Example 3c. 

Alternatively, the reporter gene may be inserted into the E3 region of pBHGlO or 
pBHGl 1 (using the unique Pad site) rather than into the polylinker located in the El region 
of the shuttle vector. The reporter gene-containing pBHGlO or 1 1 is then used in place of 
20 Ad5<//7001 for cotransfection of 293 cells along with the loxP-containing pAdBglll. 

Following cotransfection. recombinant plaques are picked, plaque purified, and tested 
for incorporation of both hpAp and loxP sequences by PCR and Southern analysis (Ex. 3). 
Viruses which contain loxP sites flanking the packaging signals and the marker gene (hpAp) 
are retained, propagated and purified. 
25 Tlie isolated helper virus containing loxP sites flanking the packaging signals and the 

marker gene is then used to infect both 293 cells and the 293 cell lines that express Cre 
(section a. above). Cre recombinase-expressing cell lines that produce optimal levels of Cre 
recombinase when infected with the /ar/ J -containing helper virus are then used for the 
generation of EAMs as described below. 



30 
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c) Generation Of Encapsidated Adenovirus Minichromosomes 
That Express Dystrophin 

The growth of EAMs is optimized using the following methods. In the first method, 
plasmid DNA from the EAM vector (Ex. 6) are co-transfected into 293 cells with purified 
viral DNA from the helper virus, and viruses are harvested 10-14 days later after appearance 
of a viral cytopathic effect (CPE) as described in Example 6. This approach is simple, yet 
potentially increases helper virus levels by allowing viral spread throughout the culture dishes. 

In the second method, the co-transfected 293 cells are overlaid with agar and single 
plaques are picked and tested for EAM activity by the ability to express P-galactosidase 
activity following infection of Hela cells (Ex. 6). This second approach is more time 
consuming, but will result in less contamination by helper. 

The ability of these two methods to generate EAMs are directly compared. The 
preferred method is that which produces the highest titer of EAM with the lowest 
contamination of helper virus. The efficiency of EAM generation may also be increased by 
using helper virus DNA that retains the terminal protein (TP) {i.e.. the helper virus is used as 
a viral DNA-TPC as described in Ex. 3) The use of viral DNA-TPC has been shown to 
increase the efficiency of viral production following transfection of 293 cells by an order of 
magnitude [Miyake et al. (1996). supra]. 

The initial transfection will utilize 293 cells that do not express Cre recombinase. so 
that efficient spread of the helper can lead to large scale production of EAM. The method 
producing the highest ratio of EAM to helper virus is then employed to optimize conditions 
for the serial propagation of the EAM on 293 cells expressing Cre recombinase. This 
optimization is conducted using cell lines expressing different levels of Cre recombinase. The 
following variables are tested: 1) the ratio of input viral titer to cell number. 2) the number 
of serial passages to use for EAM generation, 3) the use of cell lines producing different 
levels of Cre to achieve the optimal ratio between high EAM titer and low helper titer. 4) 
continuous growth on Off-producing 293 cells versus alternating between G-e-producing cells 
and the parental 293 cells, or to alternate between high and low Oe-producing cells and 5) 
CsCl purification of EAMs prior to re-infection of 293 cells increases the ratio of the final 
EAM/helper titers. The protocols that result in the highest yield of EAM with minimal helper 
virus are used to generate large volumes of crude viral lysates for purification by density 
gradient centrifugation (Ex. 6). 
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EAMs were purified from helper virus using standard CsCI density gradient 
centrifugation protocols in Example 6 and resulted in preparations containing -4% helper virus. 
In order to improve the physical separation of EAMs and helper virus, a variety of different 
centrifugation conditions are possible, including changing the gradient shape, type of rotor and 
tube used, combinations of step and continuous gradients, and the number of gradients used. 
Materials with better resolving powers than CsCI can also be employed. These include 
rubidium chloride and potassium bromide [Reich and Zarybnicky (1979) Annal. Biochem 
94:193]. 

10 d) Use Of EAMs Encoding Dystrophin For Long Term 

Expression Of Dystrophin in Muscle 

To demonstrate the ability of the purified dystrophin expressing EAMs (prepared as 
described above) to deliver and express dystrophin in the muscle of an animal, the following 
experiment is conducted. First, purified dystrophin-EAMs are delivered to the muscle by 

15 direct intramuscular injection into newborn. 1 month, and adult (3 month) mouse quadriceps 
muscle. The dystrophin expressing EAM AdSPDys are injected into mdx mice and into 
transgenic mdx mice that express p-galactosidase in the pituitary gland [Tripathy et al (1996) 
Nature Med. 2:545). These latter mice are used to avoid potential immune-rejection of cells 
expressing P-galactosidase from the Ad5pDys vector. An alternate EAM lacking the P- 

20 galactosidase reporter gene may also be employed; however, the presence of the p-Gal 

reporter simplifies EAM growth and purification (vectors lacking P-Gal have their puritv 
estimated by PCR assays rather than by p-Gal assays). 

Following intramuscular injection of EAM. animals are sacrificed at intervals between 
1 week and 6 months to measure dystrophin expression [by western blot analysis and bv 

25 immunofluorescence (Phelps et al (1995) Hum. Mol. Genet. 4:1251 and Rafael et al (1996) 
J. Cell Biol. 134:93] and muscle extracts will also be assayed for P-Gal activity [MacGregor 
et al (1991). supra]. These results are compared with previous results obtained using current 
generation viral vectors (i.e.. containing deletions in El and E3 only) to demonstrate that 
EAMs improve the prospects for long term gene expression in muscle. 

30 
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e) Use Of Dystrophin-EAMs To Prevent, Hall, Or Reverse The 
Dystrophic Symptoms That Develop In The Muscles Of mdx 
Mice 

To demonstrate that beneficial effects on dystrophic muscle are achieved by deliver) 
of dystrophin expressing EAMs to mdx mice, the following experiments are conducted. 
Central nuclei counts are performed on soleus muscle at intervals following injection of EAM 
into newborn. 1 month, and 3 month (adult) mice (Phelps et ai, 1995). Central nuclei arise 
in mouse muscle only after a myofiber has undergone dystrophic necrosis followed by 
regeneration, and is a quantitative measure of the degree of dystrophy that has occurred in a 
muscle group (Phelps et ai. 1995). 

More informative assays are also contemplated. These assays require administration of 
EAMs to mouse diaphragm muscles. 

The diaphragm is severely affected in mdx mice (Stedman et ai. 1991: Cox et ai. 
1993). and displays dramatic decreases in both force and power generation. Administration of 
virus to the diaphragm will allow the strength of the muscles to be measured at intervals 
following dystrophin delivery. The force and power generating assays developed by the 
Faulkner lab are used to measure the effect of dystrophin transgenes (Shrager et ai. 1992: 
Rafael et ai. 1994: Lynch et ai. 1996). 

To detemine whether dystrophin delivery to dystrophic muscle reverses dystrophy or 
stabilizes muscle, varying amounts of the dystrophin EAM are delivered to the to diaphragm 
at different stages of the dystrophic process and then strength is measured at intervals 
following EAM administration. First, various titers of EAM are tested to determine the 
minimal amount of virus needed to transduce the majority of muscle fibers in the diaphragm. 
It has been shown that conventional adenovirus vectors can transduce the majority of 
diaphragm fibers when 10 8 pfu are administrated by direct injection into the intraperitoneal 
cavity [Huard ei ai (1995) Gene Therapy 2:107]. In addition, it has been shown that 
transduction of a simple majority of fibers in a muscle group is sufficient to prevent virtually 
all the dystrophic symptoms in mice [Rafael et ai (1994) Hum. Mol. Genet. 3:1725 and 
Phelps et ai (1995) Hum. Mol. Genet. 4:1251]. Virus is administered to max animals at three 
different ages (neonatal. 1 month, and 3 months). Animals are sacrificed for physiological 
analysis of diaphragm muscle at two different times post infection ( 1 month and 3 months). 
Error control is achieved by performing these experiments in sextuplicate. Control animals 
consist of mock injected wild-type (C57B1/10) and dystrophic (mdx) mice. Three month old 



- 65 - 



WO 98/17783 PCT/US97/19541 

mdx mice display a 40% reduciion in force and power generation compared with wild-type 
mice, while 6 month animals display a greater than 50% reduciion (Cox ei ai (1993) Nature 
364:725: Rafael ei ai (1994). supra: Phelps ei ai (1995). supra: and Corrado et ai (1996) J. 
Cell. Biol. 134:873]. 



EXAMPLE 8 

Improved Shuttle Vectors For The 
Production Of Helper Virus Containing LoxP Sites 



In the previous example, a shuttle vector containing adenoviral sequences extending 
from 9 mu to 16 mu of the Ad5 genome was modified to contain loxP sequences surrounding 
the packaging signals (the LoxP shuttle vector). This modified shuttle vector was then 
recombined with an Ad virus to produce a helper virus containing the loxP sequences. This 
helper virus was then used to infect cells expressing Cre recombinase along with DNA 
comprising a minichromosome containing the dystrophin gene, a reporter gene and the 
packaging signals and ITRs of Ad in order to preferentially package the minichromosomes. 
Using this approach the helper virus, which has had the packaging signals removed by Cre- 
loxP recombination, contains the majority of the Ad genome (only a portion of the E3 region 
is deleted). Thus, if low levels of helper virus are packaged and appear in the EAM 
preparation, the EAM preparation has the potential of passing on helper virus capable of 
directing the expression of Ad proteins in cells which are exposed to the EAM preparation. 
The expression of Ad proteins may lead to an immune response directed against the infected 
cells. 

Another approach to reducing the possibility that the EAM preparation contains helper 
virus capable of provoking an immune response is to use helper viruses containing deletions 
and/or mutations within the pol and pTP genes. Helper virus containing a deletion in the pol 
and/or pTP genes is cotransfected with the EAM construct into 293-derived cell lines 
expressing pol or pol and pTP to produce EAMs. Any helper virus present in the purified 
EAM preparation will be replication defective due to the deletion in the pol and/or pTP genes. 
As shown in Example 3, viruses containing a deletion in the pol gene are incapable of 
directing the expression of viral late genes: therefore, helper viruses containing a deletion in 
the pol gene or the pol and pTP genes should not be capable of provoking an immune 
response (/.c. a CTL response) against late viral proteins synthesized de novo. Shuttle 
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eetors contain deletions wi , hin , he M p0| ^ pJp ^ ^ ^ ^ 

plnw 



below 

a) Construction Of A Shuttle Vector Containing The Apol 
Deletion 



pAdBglll was modified , 0 comain S£(luences correspon(Jing lo , m 4() mu of 
? enome as fo„ow, pAdBg.,1 was di g esIed wilh ^ and , ^ ^ 

■s..e was added to create pAdBglllAse. pAdBglllAse was then digested with &„ , 071 
AdMfTOOl viral DNA was d.gested wi,„ a* and .he ends were tilled in using T4 DNA 
polymerase. The ^-digested. T4 DNA polymerase filled Ad5„/700, vi, a | DNA was then 
digested wi,h 1 07! and ,he -,, kb Ascl . Bsl] lmmM) ^ ^ ^ 

PTP genes was isolated ,as described in Ex. 3) and ligated ,o the Bs,l 1071-digested 
PAdBglllAse ,o generate pAdAsc. pAdAsc is a shuttle vector which contains ,he genes 
encode ,he DNA polymerase and preterminal protein „he inserted ,.«l- to , ,071 fragmen, 
corresponds lo nucleotides 5767 to 15. 671 of the Ad5 genome). 

A shuttle vector. pAdAscApol. which contains the 612 bp delehon in the p„, gene 
(desenbed ,n Ex. 3) was constructed as follows. Ad5A P olAE3l viral DNA (Ex 3) was 
dtgested with M and the ends were filled in using T4 DNA polymerase. The ^.-digested 
T4 DNA polymerase filled Ad5ApolAE3I viral DNA was then digested wi,h B,,l 1071 and the 
-0.3 kb A*,.*,t ,071 ( f,Hed) fragment containing ,he dele.ed pol gene and ,„e pTP gene was 
■so.ated ,as described in Ex. 3) and ligated to Bs,l ,071-digested pAdBglllAse ,„ generate 
pAdAscApol. 

b) Construction Of A Shuttle Vector Containing The ApolApTP 
Deletion 

A shuttle vecor. pAdAscApolApTP. which contains a 2.3 kb deletion within the pol 
and pTP genes (described in Ex. 5) was constructed as follows. 

P AXBApo.ApTPVARNA+t.3 (Ex. 5b) was digested with Axl and the ends were filled in 
using T4 DNA polymerase. The ^rl-digested. T4 DNA polymerase filled 
P AXBAp 0 lApTPVARNA + tl3 DNA was then digested with Bsil 1071 and the ~7 6 kb Ascl- 
Bsa .(^filled) fragment containing the deleted pol gene and the pTP gene was isolated (as 
desenbed in Ex. 3) and ligated to Bst\ 1071-digested pAdBglllAse to generate 
pAdAscApolApTP. 

In order to reduce the packaging of the above helper viruses, the pol" or pol" pTP' 
helper viruses can be modified ,„ income loxP sequences on either side of the packaging 
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signals as outlined in Example 7. The /oxP-containing pol" or pol". pTP' shunle vectors 
(pTP- shuttle vectors may also be employed) are cotranslected into 293 cells expressing Cre 
recombinase and pol or Cre recombinase. pol and pTP. respectively along with an appropriate 
El " viral DNA-TPC (the El- viral DNA may also contain deletions elsewhere in the genome 
5 such as in the E4 genes or in the E2a gene as packaging cell lines expressing the E4 ORF 6 
or 6/7 and lines expressing E2a genes are avaialble) to generate helper virus containing loxP 
sites flanking the packaging signals as well as a deletion in the pol gene or the pol and pTP 
genes. The resulting helper virus(es) is used to cotransfect 293 cells expressing pol or pol 
and pTP along with the desired EAM construct. The resulting EAM preparation should 

10 contain little if any helper virus and any contaminating helper virus present would be 

replication defective and incapable of expressing viral late gene products. Helper viruses 
containing loxP sequences and deletions in all essential early genes may be emploved in 
conjunction with Cre recombinanse-expressing cell lines expressing in trans the El. E4 ORF 
6. E2a. and E2b (e.g.. Ad polymerase and pTP) proteins (the E3 proteins are dispensible for 

15 growth in culture). Cell lines coexpressing El. polymerase and pTP are provided herein. 

Cell lines expressing El and E4 proteins have been recently described [Krougiiak and Graham 
(1995) Hum. Gene Ther. 6:1575 and Wang et ai (1995) Gene Ther. 2:775] and cell lines 
expressing El and E2a proteins have been recently described [Zhou et ai (1996) J. Virol. 
70:7030]. Therefore, a cell line co-expressing El. E2a, E2b. and E4 is consructed by 

20 introduction of expression plasmids containing the E2a and E4 coding regions into the E1-. 

Ad polymerase- and pTP-expressing cell lines of the present invention. These packaging cell 
lines are used in conjunction with helper viruses containing deletions in the El. E2a. E2b and 
E4 regions. 

All publications and patents mentioned in the above specification are herein 
25 incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
30 the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Chamberlain, Jeffrey S. 

Amalfitano, Andrea 
Hauser, Michael A. 
Kumar-Singh, Rajendra 
Hartigan-O' Connor , Dennis J . 

(ii) TITLE OF INVENTION: IMPROVED ADENOVIRUS VECTORS 

(iii) NUMBER OF SEQUENCES : 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Medlen & Carroll, LLP 

(B) STREET: 220 Montaomery Street, Suite 2200 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: United States Of America 

(F) ZIP: 94104 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version Sl.30 

(vi> CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Ingolia, Diane E . 

(B) REGISTRATION NUMBER: 40,027 

(C) REFERENCE /DOCKET NUMBER ': UM- 024 84 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 705-8410 

(B) TELEFAX: (415) 397-8336 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 593 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CATCATCAAT 


AATATACCTT 


ATTTTGGATT 


GAAGCCAATA 


TGATAATGAG 


GGGGTGGAGT 


60 


TTGTGACGTG 


GCGCGGGGCG 


TGGGAACGGG 


GCGGGTGACG 


TAGTAGTGTG 


GCGGAAGTGT 


120 


GATGTTGCAA 


GTGTGGCGGA 


ACACATGTAA 


GCGACGGATG 


TGGCAAAAGT 


GACGTTTTTG 


180 


GTGTGCGCCG 


GTGTACACAG 


GAAGTGACAA 


TTTTCGCGCG 


GTTTTAGGCG 


GATGTTGTAG 


240 


TAAATTTGGG 


CGTAACCGAG 


TAAGATTTGG 


CCATTTTCGC 


GGGAAAACTG 


AATAAGAGGA 


300 


AGTGAAATCT 


GAATAATTTT 


GTGTTACTCA 


TAGCGCGTAA 


TATTTGTCTA 


GGGCCGCGGG 


360 


GACTTTGACC 


GTTTACGTGG 


AGACTCGCCC 


AGGTGTTTTT 


CTCAGGTGTT 


TTCCGCGTTC 


420 


CGGGTCAAAG 


TTGGCGTTTT 


ATTATTATAG 


TCAGCTGACG 


TGTAGTGTAT 


TTATACCCGG 


480 
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TGAGTTCCTC 


AAGAGGCCAC 


TCTTGAGTGC 


CAGCGAGTAG 


AGTTTTCTCC 


TCCGAGCCGC 


540 


TCCGACACCG 


GGACTGAAAA 


TGAGACATAT 


TATCTGCCAC 


GGAGGTGTTA 


TTACCGAAGA 


600 


AATGGCCGCC 


AGTCTTTTGG 


ACCAGCTGAT 


CGAAGAGGTA 


CTGGCTGATA 


ATCTTCCACC 


660 


TCCTAGCCAT 


TTTGAACCAC 


CTACCCTTCA 


CGAACTGTAT 


GATTTAGACG 


TGACGGCCCC 


720 


CGAAGATCCC 


AACGAGGAGG 


CGGTTTCGCA 


GATTTTTCCC 


GACTCTGTAA 


TGTTGGCGGT 


780 


GCAGGAAGGG 


ATTGACTTAC 


TCACTTTTCC 


GCCGGCGCCC 


GGTTCTCCGG 


AGCCGCCTCA 


840 


CCTTTCCCGG 


CAGCCCGAGC 


AGCCGGAGCA 


GAGAGCCTTG 


GGTCCGGTTT 


CTATGCCAAA 


900 


CCTTGTACCG 


GAGGTGATCG 


ATCTTACCTG 


CCACGAGGCT 


GGCTTTCCAC 


CCAGTGACGA 


960 


CGAGGATGAA 


GAGGGTGAGG 


AGTTTGTGTT 


AGATTATGTG 


GAGCACCCCG 


GGCACGGTTG 


1020 


CAGGTCTTGT 


CATTATCACC 


GGAGGAATAC 


GGGGGACCCA 


GATATTATGT 


GTTCGCTTTG 


1080 


CTATATGAGG 


ACCTGTGGCA 


TGTTTGTCTA 


CAGTAAGTGA 


AAATTATGGG 


CAGTGGGTGA 


1140 


TAGAGTGGTG 


GGTTTGGTGT 


GGTAATTTTT 


TTTTTAATTT 


TTACAGTTTT 


GTGGTTTAAA 


1200 


GAATTTTGTA 


TTGTGATTTT 


TTTAAAAGGT 


CCTGTGTCTG 


AACCTGAGCC 


TGAGCCCGAG 


1260 


CCAGAACCGG 


AGCCTGCAAG 


ACCTACCCGC 


CGTCCTAAAA 


TGGCGCCTGC 


TATCCTGAGA 


1320 


CGCCCGACAT 


CACCTGTGTC 


TAGAGAATGC 


AATAGTAGTA 


CGGATAGCTG 


TGACTCCGGT 


1380 


CCTTCTAACA 


CACCTCCTGA 


GATACACCCG 


GTGGTCCCGC 


TGTGCCCCAT 


TAAACCAGTT 


1440 


GCCGTGAGAG 


TTGGTGGGCG 


TCGCCAGGCT 


GTGGAATGTA 


TCGAGGACTT 


GCTTAACGAG 


1500 


CCTGGGCAAC 


CTTTGGACTT 


GAGCTGTAAA 


CGCCCCAGGC 


CATAAGGTGT 


AAACCTGTGA 


1560 


TTGCGTGTGT 


GGTTAACGCC 


TTTGTTTGCT 


GAATGAGTTG 


ATGTAAGTTT 


AATAAAGGGT 


1620 


GAGATAATGT 


TTAACTTGCA 


TGGCGTGTTA 


AATGGGGCGG 


GGCTTAAAGG 


GTATATAATG 


1680 


CGCCGTGGGC 


TAATCTTGGT 


TACATCTGAC 


CTCATGGAGG 


CTTGGGAGTG 


TTTGGAAGAT 


1740 


TTTTCTGCTG 


TGCGTAACTT 


GCTGGAACAG 


AGCTCTAACA 


GTACCTCTTG 


GTTTTGGAGG 


1800 


TTTCTGTGGG 


GCTCATCCCA 


GG CAAAGTTA 


GTCTGCAGAA 


TTAAGGAGGA 


TTACAAGTGG 


1860 


GAATTTGAAG 


AGCTTTTGAA 


ATCCTGTGGT 


GAGCTGTTTG 


ATTCTTTGAA 


TCTGGGTCAC 


1920 


CAGGCGCTTT 


TCCAAGAGAA 


GGTCATCAAG 


ACTTTGGATT 


TTTCCACACC 


GGGGCGCGCT 


1980 


GCGGCTGCTG 


TTGCTTTTTT 


GAGTTTTATA 


AAGGATAAAT 


GGAGCGAAGA 


AACCCATCTG 


2040 


AGCGGGGGGT 


ACCTGCTGGA 


TTTTCTGGCC 


ATGCATCTGT 


GGAGAGCGGT 


TGTGAGACAC 


2100 


AAGAATCGCC 


TGCTACTGTT 


GTCTTCCGTC 


CGCCCGGCGA 


TAATACCGAC 


GGAGGAGCAG 


2160 


CAGCAGCAGC 


AGGAGGAAGC 


CAGGCGGCGG 


CGGCAGGAGC 


AGAGCCCATG 


GAACCCGAGA 


2220 


GCCGGCCTGG 


ACCCTCGGGA 


ATGAATGTTG 


TACAGGTGGC 


TGAACTGTAT 


CCAGAACTGA 


2280 


GACGCATTTT 


GACAATTACA 


GAGGATGGGC 


AGGGGCTAAA 


GGGGGTAAAG 


AGGGAGCGGG 


2340 


GGGCTTGTGA 


GGCTACAGAG 


GAGGCTAGGA 


ATCTAGCTTT 


TAGCTTAATG 


ACCAGACACC 


2400 


GTCCTGAGTG 


TATTACTTTT 


CAACAGATCA 


AGGATAATTG 


CGCTAATGAG 


CTTGATCTGC 


2460 


TGGCGCAGAA 


GTATTCCATA 


GAGCAGCTGA 


CCACTTACTG 


GCTGCAGCCA 


GGGGATGATT 


2520 


TTGAGGAGGC 


TATTAGGGTA 


TATGCAAAGG 


TGGCACTTAG 


GCCAGATTGC 


AAGTACAAGA 


2580 
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TCAGCAAACT TGTAAATATC AGGAATTGTT GCTACATTTC TGGGAACGGG GCCGAGGTGG 



AGATAGATAC GGAGGATAGG GTGGCCTTTA GATGTAGCAT GATAAATATG TGGCCGGGGG 
TGCTTGGCAT GGACGGGGTG GTTATTATGA ATGTAAGGTT TACTGGCCCC AATTTTAGCG 
GTACGGTTTT CCTGGCCAAT ACCAACCTTA TCCTACACGG TGTAAGCTTC TATGGGTTTA 
ACAATACCTG TGTGGAAGCC TGGACCGATG TAAGGGTTCG GGGCTGTGCC TTTTACTGCT 
GCTGGAAGGG GGTGGTGTGT CGCCCCAAAA GCAGGGCTTC AATTAAGAAA TGCCTCTTTG 
AAAGGTGTAC CTTGGGTATC CTGTCTGAGG GTAACTCCAG GGTGCGCCAC AATGTGGCCT 
CCGACTGTGG TTGCTTCATG CTAGTGAAAA GCGTGGCTGT GATTAAGCAT AACATGGTAT 
GTGGCAACTG CGAGGACAGG GCCTCTCAGA TGCTGACCTG CTCGGACGGC AACTGTCACC 
TGCTGAAGAC CATTCACGTA GCCAGCCACT CTCGCAAGGC CTGGCCAGTG TTTGAGCATA 
ACATACTGAC CCGCTGTTCC TTGCATTTGG GTAACAGGAG GGGGGTGTTC CTACCTTACC 
AATGCAATTT GAGTCACACT AAGATATTGC TTGAGCCCGA GAGCATGTCC AAGGTGAACC 
TGAACGGGGT GTTTGACATG ACCATGAAGA TCTGGAAGGT GCTGAGGTAC GATGAGACCC 
GCACCAGGTG CAGACCCTGC GAGTGTGGCG GTAAACATAT TAGGAACCAG CCTGTGATGC 
TGGATGTGAC CGAGGAGCTG AGGCCCGATC ACTTGGTGCT GGCCTGCACC CGCGCTGAGT 
TTGGCTCTAG CGATGAAGAT ACAGATTGAG GTACTGAAAT GTGTGGGCGT GGCTTAAGGG 
TGGGAAAGAA TATATAAGGT GGGGGTCTTA TGTAGTTTTG TATCTGTTTT GCAGCAGCCG 
CCGCCGCCAT GAGCACCAAC TCGTTTGATG GAAGCATTGT GAGCTCATAT TTGACAACGC 
GCATGCCCCC ATGGGCCGGG GTGCGTCAGA ATGTGATGGG CTCCAGCATT GATGGTCGCC 
CCGTCCTGCC CGCAAACTCT ACTACCTTGA CCTACGAGAC CGTGTCTGGA ACGCCGTTGG 
AGACTGCAGC CTCCGCCGCC GCTTCAGCCG CTGCAGCCAC CGCCCGCGGG ATTGTGACTG 
ACTTTGCTTT CCTGAGCCCG CTTGCAAGCA GTGCAGCTTC CCGTTCATCC GCCCGCGATG 
ACAAGTTGAC GGCTCTTTTG GCACAATTGG ATTCTTTGAC CCGGGAACTT AATGTCGTTT 
CTCAGCAGCT GTTGGATCTG CGCCAGCAGG TTTCTGCCCT GAAGGCTTCC TCCCCTCCCA 
ATGCGGTTTA AAACATAAAT AAAAAACCAG ACTCTGTTTG GATTTGGATC AAGCAAGTGT 
CTTGCTGTCT TTATTTAGGG GTTTTGCGCG CGCGGTAGGC CCGGGACCAG CGGTCTCGGT 
CGTTGAGGGT CCTGTGTATT TTTTCCAGGA CGTGGTAAAG GTGACTCTGG ATGTTCAGAT 
ACATGGGCAT AAGCCCGTCT CTGGGGTGGA GGTAGCACCA CTGCAGAGCT TCATGCTGCG 
GGGTGGTGTT GTAGATGATC CAGTCGTAGC AGGAGCGCTG GGCGTGGTGC CTAAAAATGT 
CTTTCAGTAG CAAGCTGATT GCCAGGGGCA GGCCCTTGGT GTAAGTGTTT ACAAAGCGGT 
TAAGCTGGGA TGGGTGCATA CGTGGGGATA TGAGATGCAT CTTGGACTGT ATTTTTAGGT 
TGGCTATGTT CCCAGCCATA TCCCTCCGGG GATTCATGTT GTGCAGAACC ACCAGCACAG 
TGTATCCGGT GCACTTGGGA AATTTGTCAT GTAGCTTAGA AGGAAATGCG TGGAAGAACT 
TGGAGACGCC CTTGTGACCT CCAAGATTTT CCATGCATTC GTCCATAATG ATGGCAATGG 
GCCCACGGGC GGCGGCCTGG GCGAAGATAT TTCTGGGATC ACTAACGTCA TAGTTGTGTT 



2640 
'2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
438C 
4440 
4500 
4560 
4620 
4680 
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CCAGGATGAG 


ATCGTCATAG 


GCCATTTTTA 


CAAAGCGCGG 


GCGGAGGGTG 


CCAGACTGCG 


4740 


GTATAATGGT 


TCCATCCGGC 


CCAGGGGCGT 


AGTTACCCTC 


ACAGATTTGC 


ATTTCCCACG 


4800 


CTTTGAGTTC 


AGATGGGGGG 


ATCATGTCTA 


CCTGCGGGGC 


GATGAAGAAA 


ACGGTTTCCG 


4860 


GGGTAGGGGA 


GATCAGCTGG 


GAAGAAAGCA 


GGTTCCTGAG 


CAGCTGCGAC 


TTACCGCAGC 


4920 


CGGTGGGCCC 


GTAAATCACA 


CCTATTACCG 


GGTGCAACTG 


GTAGTTAAGA 


GAGCTGCAGC 


4980 


TGCCGTCATC 


CCTGAGCAGG 


GGGGCCACTT 


CGTTAAGCAT 


GTCCCTGACT 


CGCATGTTTT 


5040 


CCCTGACCAA 


ATCCGCCAGA 


AGGCGCTCGC 


CGCCCAGCGA 


TAGCAGTTCT 


TGCAAGGAAG 


5100 


CAAAGTTTTT 


CAACGGTTTG 


AGACCGTCCG 


CCGTAGGCAT 


GCTTTTGAGC 


GTTTGACCAA 


5160 


GCAGTTCCAG 


GCGGTCCCAC 


AGCTCGGTCA 


CCTGCTCTAC 


GGCATCTCGA 


TCCAGCATAT 


5220 


CTCCTCGTTT 


CGCGGGTTGG 


GGCGGCTTTC 


GCTGTACGGC 


AGTAGTCGGT 


GCTCGTCCAG 


5280 


ACGGGCCAGG 


GTCATGTCTT 


TCCACGGGCG 


CAGGGTCCTC 


GTCAGCGTAG 


TCTGGGTCAC 


5340 


GGTGAAGGGG 


TGCGCTCCGG 


GCTGCGCGCT 


GGCCAGGGTG 


CGCTTGAGGC 


TGGTCCTGCT 


5400 


GGTGCTGAAG 


CGCTGCCGGT 


CTTCGCCCTG 


CGCGTCGGCC 


AGGTAGCATT 


TGACCATGGT 


5460 


GTCATAGTCC 


AGCCCCTCCG 


CGGCGTGGCC 


CTTGGCGCGC 


AGCTTGCCCT 


TGGAGGAGGC 


5520 


GCCGCACGAG 


GGGCAGTGCA 


GACTTTTGAG 


GGCGTAGAGC 


TTGGGCGCGA 


GAAATACCGA 


5580 


TTCCGGGGAG 


TAGGCATCCG 


CGCCGCAGGC 


CCCGCAGACG 


GTCTCGCATT 


CCACGAGCCA 


5640 


GGTGAGCTCT 


GGCCGTTCGG 


GGTCAAAAAC 


CAGGTTTCCC 


CCATGCTTTT 


TGATGCGTTT 


5700 


CTTACCTCTG 


GTTTCCATGA 


GCCGGTGTCC 


ACGCTCGGTG 


ACGAAAAGGC 


TGTCCGTGTC 


5760 


CCCGTATACA 


GACTTGAGAG 


GCCTGTCCTC 


GAGCGGTGTT 


CCGCGGTCCT 


CCTCGTATAG 


5820 


AAACTCGGAC 


CACTCTGAGA 


CAAAGGCTCG 


CGTCCAGGCC 


AGCACGAAGG 


AGGCTAAGTG 


5880 


GGAGGGGTAG 


CGGTCGTTGT 


CCACTAGGGG 


GTCCACTCGC 


TCCAGGGTGT 


GAAGACACAT 


5940 


GTCGCCCTCT 


TCGGCATCAA 


GGAAGGTGAT 


TGGTTTGTAG 


GTGTAGGCCA 


CGTGACCGGG 


6000 


TGTTCCTGAA 


GGGGGGCTAT 


AAAAGGGGGT 


GGGGGCGCGT 


TCGTCCTCAC 


TCTCTTCCGC 


6060 


ATCGCTGTCT 


GCGAGGGCCA 


GCTGTTGGGG 


TGAGTACTCC 


CTCTGAAAAG 


CGGGCATGAC 


6120 


TTCTGCGCTA 


AGATTGTCAG 


TTTCCAAAAA 


CGAGGAGGAT 


TTGATATTCA 


CCTGGCCCGC 


6180 


GGTGATGCCT 


TTGAGGGTGG 


CCGCATCCAT 


CTGGTCAGAA 


AAGACAATCT 


TTTTGTTGTC 


6240 


AAGCTTGGTG 


GCAAACGACC 


CGTAGAGGGC 


GTTGGACAGC 


AACTTGGCGA 


TGGAGCGCAG 


6300 


GGTTTGGTTT 


TTGTCGCGAT 


CGGCGCGCTC 


CTTGGCCGCG 


ATGTTTAGCT 


GCACGTATTC 


6360 


GCGCGCAACG 


CACCGCCATT 


CGGGAAAGAC 


GGTGGTGCGC 


TCGTCGGGCA 


CCAGGTGCAC 


6420 


GCGCCAACCG 


CGGTTGTGCA 


GGGTGACAAG 


GTCAACGCTG 


GTGGCTACCT 


CTCCGCGTAG 


6480 


GCGCTCGTTG 


GTCCAGCAGA 


GGCGGCCGCC 


CTTGCGCGAG 


CAGAATGGCG 


GTAGGGGGTC 


6540 


TAGCTGCGTC 


TCGTCCGGGG 


GGTCTGCGTC 


CACGGTAAAG 


ACCCCGGGCA 


GCAGGCGCGC 


6600 


GTCGAAGTAG 


TCTATCTTGC 


ATCCTTGCAA 


GTCTAGCGCC 


TGCTGCCATG 


CGCGGGCGGC 


6660 


AAGCGCGCGC 


TCGTATGGGT 


TGAGTGGGGG 


ACCCCATGGC 


ATGGGGTGGG 


TGAGCGCGGA 


6720 


GGCGTACATG 


CCGCAAATGT 


CGTAAACGTA 


GAGGGGCTCT 


CTGAGTATTC 


CAAGATATGT 


6780 
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AGGGTAGCAT CTTCCACCGC GGATGCTGGC GCGCACGTAA TCGTATAGTT CGTGCGAGGG 6840 

AGCGAGGAGG TCGGGACCGA GGTTGCTACG GGCGGGCTGC TCTGCTCGGA AGACTATCTG 6900 

CCTGAAGATG GCATGTGAGT TGGATGATAT GGTTGGACGC TGGAAGACGT TGAAGCTGGC 6 96 0 

GTCTG7GAGA CCTACCGCGT CACGCACGAA GGAGGCGTAG GAGTCGCGCA GCTTGTTGAC 702 0 

CAGCTCGGCG GTGACCTGCA CGTCTAGGGC GCAGTAGTCC AGGGTTTCCT TGATGATGTC 7080 

ATACTTATCC TGTCCCTTTT TTTTCCACAG CTCGCGGTTG AGGACAAACT CTTCGCGGTC 7140 

TTTCCAGTAC TCTTGGATCG GAAACCCGTC GGCCTCCGAA CGGTAAGAGC CTAGCATGTA 7200 
GAACTGGTTG ACGGCCTGGT AGGCGCAGCA TCCCTTTTCT ACGGGTAGCG CGTATGCCTG 
CGCGGCCTTC CGGAGCGAGG TGTGGGTGAG CGCAAAGGTG TCCCTGACCA TGACTTTGAG 
GTACTGGTAT TTGAAGTCAG TGTCGTCGCA TCCGCCCTGC TCCCAGAGCA AAAAGTCCGT 
GCGCT7TTTG GAACGCGGAT TTGGCAGGGC GAAGGTGACA TCGTTGAAGA GTATCTTTCC 
CGCGCGAGGC ATAAAGTTGC GTGTGATGCG GAAGGGTCCC GGCACCTCGG AACGGTTGTT 
AATTACCTGG GCGGCGAGCA CGATCTCGTC AAAGCCGTTG ATGTTGTGGC CCACAATGTA 
AAGTTCCAAG AAGCGCGGGA TGCCCTTGAT GGAAGGCAAT TTTTTAAGTT CCTCGTAGGT 
GAGCTCTTCA GGGGAGCTGA GCCCGTGCTC TGAAAGGGCC CAGTCTGCAA GATGAGGGTT 
GGAAGCGACG AATGAGCTCC ACAGGTCACG GGCCATTAGC ATTTGCAGGT GGTCGCGAAA 

GGTCCTAAAC TGGCGACCTA TGGCCATTTT TTCTGGGGTG ATGCAGTAGA AGGTAAGCGG 7800 

GTCTTGTTCC CAGCGGTCCC ATCCAAGGTT CGCGGCTAGG TCTCGCGCGG CAGTCACTAG 7860 

AGGCTCATCT CCGCCGAACT TCATGACCAG CATGAAGGGC ACGAGCTGCT • TCCCAAAGGC 7920 

CCCCATCCAA GTATAGGTCT CTACATCGTA GGTGACAAAG AGACGCTCGG TGCGAGGATG 7 980 

CGAGCCGATC GGGAAGAACT GGATCTCCCG CCACCAATTG GAGGAGTGGC TATTGATGTG 8 04 0 
GTGAAAGTAG AAGTCCCTGC GACGGGCCGA ACACTCGTGC TGGCTTTTGT AAAAACGTGC 
GCAGTACTGG CAGCGGTGCA CGGGCTGTAC ATCCTGCACG AGGTTGACCT GACGACCGCG 
CACAAGGAAG CAGAGTGGGA ATTTGAGCCC CTCGCCTGGC GGGTTTGGCT GGTGGTCTTC 
TACTTCGGCT GCTTGTCCTT GACCGTCTGG CTGCTCGAGG GGAGTTACGG TGGATCGGAC 

CACCACGCCG CGCGAGCCCA AAGTCCAGAT GTCCGCGCGC GGCGGTCGGA GCTTGATGAC 8340 

AACATCGCGC AGATGGGAGC TGTCCATGGT CTGGAGCTCC CGCGGCGTCA GGTCAGGCGG 84 00 

GAGCTCCTGC AGGTTTACCT CGCATAGACG GGTCAGGGCG CGGGCTAGAT CCAGGTGATA 8460 

CCTAATTTCC AGGGGCTGGT TGGTGGCGGC GTCGATGGCT TGCAAGAGGC CGCATCCCCg' 8520 

CGGCGCGACT ACGGTACCGC GCGGCGGGCG GTGGGCCGCG GGGGTGTCCT TGGATGATGC 8580 

ATCTAAAAGC GGTGACGCGG GCGAGCCCCC GGAGGTAGGG GGGGCTCCGG ACCCGCCGGG 8640 

AGAGGGGGCA GGGGCACGTC GGCGCCGCGC GCGGGCAGGA GCTGGTGCTG CGCGCGTAGG 8700 

TTGCTGGCGA ACGCGACGAC GCGGCGGTTG ATCTCCTGAA TCTGGCGCCT CTGCGTGAAG 8760 

ACGACGGGCC CGGTGAGCTT GAGCCTGAAA GAGAGTTCGA CAGAATCAAT TTCGGTGTCG 8820 

TTGACGGCGG CCTGGCGCAA AATCTCCTGC ACGTCTCCTG AGTTGTCTTG ATAGGCGATC 8880 



8100 
8160 
8220 
8280 
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TCGGCCA7GA 


ACTGCTCGAT 


CTCTTCCTCC 


TGGAGATCTC 


CGCGTCCGGC 


TCGCTCCACG 


8940 


GTGGCGGCGA 


GGTCGTTGGA 


AATGCGGGCC 


ATGAGCTGCG 


AGAAGGCGTT 


GAGGCCTCCC 


9000 


TCGTTCCAGA 


CGCGGCTGTA 


GACCACGCCC 


CCTTCGGCAT 


CGCGGGCGCG 


CATGACCACC 


9060 


TGCGCGAGAT 


TGAGCTCCAC 


GTGCCGGGCG 


AAGACGGCGT 


AGTTTCGCAG 


GCGCTGAAAG 


9120 


AGGTAGTTGA 


GGGTGGTGGC 


GGTGTGTTCT 


GCCACGAAGA 


AGTACATAAC 


CCAGCGTCGC 


9180 


AACGTGGATT 


CGTTGATATC 


CCCCAAGGCC 


TCAAGGCGCT 


CCATGGCCTC 


GTAGAAGTCC 


9240 


ACGGCGAAGT 


TGAAAAACTG 


GGAGTTGCGC 


GCCGACACGG 


TTAACTCCTC 


CTCCAGAAGA 


9300 


CGGATGAGCT 


CGGCGACAGT 


GTCGCGCACC 


TCGCGCTCAA 


AGGCTACAGG 


GGCCTCTTCT 


9360 


TCTTCTTCAA 


TCTCCTCTTC 


CATAAGGGCC 


TCCCCTTCTT 


CTTCTTCTGG 


CGGCGGTGGG 


9420 


GGAGGGGGGA 


CACGGCGGCG 


ACGACGGCGC 


ACCGGGAGGC 


GGTCGACAAA 


GCGCTCGATC 


9480 


ATCTCCCCGC 


GGCGACGGCG 


CATGGTCTCG 


GTGACGGCGC 


GGCCGTTCTC 


GCGGGGGCGC 


954C 


AGTTGGAAGA 


CGCCGCCCGT 


CATGTCCCGG 


TTATGGGTTG 


GCGGGGGGCT 


GCCATGCGGC 


9600 


AGGGATACGG 


CGCTAACGAT 


GCATCTCAAC 


AATTGTTGTG 


TAGGTACTCC 


GCCGCCGAGG 


9660 


GACCTGAGCG 


AGTCCGCATC 


GACCGGATCG 


GAAAACCTCT 


CGAGAAAGGC 


GTCTAACCAG 


9720 


TCACAGTCGC 


AAGGTAGGCT 


GAGCACCGTG 


GCGGGCGGCA 


GCGGGCGGCG 


GTCGGGGTTG 


9780 


TTTCTGGCGG 


AGGTGCTGCT 


GATGATGTAA 


TTAAAGTAGG 


CGGTCTTGAG 


ACGGCGGATG 


9840 


GTCGACAGAA 


GCACCATGTC 


CTTGGGTCCG 


GCCTGCTGAA 


TGCGCAGGCG 


GTCGGCCATG 


9900 


CCCCAGGCTT 


CGTTTTGACA 


TCGGCGCAGG 


TCTTTGTAGT 


AGTCTTGCAT 


GAGCCTTTCT 


9960 


ACCGGCACTT 


CTTCTTCTCC 


TTCCTCTTGT 


CCTGCATCTC 


TTGCATCTAT 


CGCTGCGGCG 


10020 


GCGGCGGAGT 


TTGGCCGTAG 


GTGGCGCCCT 


CTTCCTCCCA 


TGCGTGTGAC 


CCCGAAGCCC 


10080 


CTCATCGGCT 


GAAGCAGGGC 


TAGGTCGGCG 


ACAACGCGCT 


CGGCTAATAT 


GGCCTGCTGC 


10140 


ACCTGCGTGA 


GGGTAGACTG 


GAAGTCATCC 


ATGTCCACAA 


AGCGGTGGTA 


TGCGCCCGTG 


10200 


TTGATGGTGT 


AAGTGCAGTT 


GGCCATAACG 


GACCAGTTAA 


CGGTCTGGTG 


ACCCGGCTGC 


10260 


GAGAGCTCGG 


TGTACCTGAG 


ACGCGAGTAA 


GCCGTCGAGT 


CAAATACGTA 


GTCGTTGCAA 


10320 


GTCCGCACCA 


GGTACTGGTA 


TCCCACCAAA 


AAGTGCGGCG 


GCGGCTGGCG 


GTAGAGGGGC 


10380 


CAGCGTAGGG 


TGGCCGGGGC 


TCCGGGGGCG 


AGATCTTCCA 


ACATAAGGCG 


ATGATATCCG 


10440 


TAGATGTACC 


TGGACATCCA 


GGTGATGCCG 


GCGGCGGTGG 


TGGAGGCGCG 


CGGAAAGTCG 


10500 


CGGACGCGGT 


TCCAGATGTT 


GCGCAGCGGC 


AAAAAGTGCT 


CCATGGTCGG 


GACGCTCTGG 


10560 


CCGGTCAGGC 


GCGCGCAATC 


GTTGACGCTC 


TAGACCGTGC 


AAAAGGAGAG 


CCTGTAAGCG 


10620 


GGCACTCTTC 


CGTGGTCTGG 


TGGATAAATT 


CGCAAGGGTA 


TCATGGCGGA 


CGACCGGGGT 


10680 


TCGAGCCCCG 


TATCCGGCCG 


TCCGCCGTGA 


TCCATGCGGT 


TACCGCCCGC 


GTGTCGAACC 


10740 


CAGGTGTGCG 


ACGTCAGACA 


ACGGGGGAGT 


GCTCCTTTTG 


GCTTCCTTCC 


AGGCGCGGCG 


10800 


GCTGCTGCGC 


TAGCTTTTTT 


GGCCACTGGC 


CGCGCGCAGC 


GTAAGCGGTT 


AGGCTGGAAA 


10860 


GCGAAAGCAT 


TAAGTGGCTC 


GCTCCCTGTA 


GCCGGAGGGT 


TATTTTCCAA 


GGGTTGAGTC 


10920 


GCGGGACCCC 


CGGTTCGAGT 


CTCGGACCGG 


CCGGACTGCG 


GCGAACGGGG 


GTTTGCCTCC 


10980 
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CCGTCATGCA AGACCCCGCT TGCAAATTCC TCCGGAAACA GGGACGAGCC CCTTTTTTGC 11040 

TTTTCCCAGA TGCATCCGGT GCTGCGGCAG ATGCGCCCCC CTCCTCAGCA GCGGCAAGAG 11100 

CAAGAGCAGC GGCAGACATG CAGGGCACCC TCCCCTCCTC CTACCGCGTC AGGAGGGGCG 11160 

ACATCCGCGG TTGACGCGGC AGCAGATGGT GATTACGAAC CCCCGCGGCG CCGGGCCCGG 11220 

CACTACCTGG ACTTGGAGGA GGGCGAGGGC CTGGCGCGGC TAGGAGCGCC CTCTCCTGAG 11280 

CGGTACCCAA GGGTGCAGCT GAAGCGTGAT ACGCGTGAGG CGTACGTGCC GCGGCAGAAC 11340 

CTGTTTCGCG ACCGCGAGGG AGAGGAGCCC GAGGAGATGC GGGATCGAAA GTTCCACGCA 11400 

GGGCGCGAGC TGCGGCATGG CCTGAATCGC GAGCGGTTGC TGCGCGAGGA GGACTTTGAG 11460 

CCCGACGCGC GAACCGGGAT TAGTCCCGCG CGCGCACACG TGGCGGCCGC CGACCTGGTA 11520 

ACCGCATACG AGCAGACGGT GAACCAGGAG ATTAACTTTC AAAAAAGCTT TAACAACCAC 11580 

GTGCGTACGC TTGTGGCGCG CGAGGAGGTG GCTATAGGAC TGATGCATCT GTGGGACTTT 11640 

GTAAGCGCGC TGGAGCAAAA CCCAAATAGC AAGCCGCTCA TGGCGCAGCT GTTCCTTATA 11700 

GTGCAGCACA GCAGGGACAA CGAGGCATTC AGGGATGCGC TGCTAAACAT AGTAGAGCCC 11760 

GAGGGCCGCT GGCTGCTCGA TTTGATAAAC ATCCTGCAGA GCATAGTGGT GCAGGAGCGC 11820 

AGCTTGAGCC TGGCTGACAA GGTGGCCGCC ATCAACTATT CCATGCTTAG CCTGGGCAAG 11880 

TTTTACGCCC GCAAGATATA CCATACCCCT TACGTTCCCA TAGACAAGGA GGTAAAGATC 11940 

GAGGGGTTCT ACATGCGCAT GGCGCTGAAG GTGCTTACCT TGAGCGACGA CCTGGGCGTT 12000 

TATCGCAACG AGCGCATCCA CAAGGCCGTG AGCGTGAGCC GGCGGCGCGA GCTCAGCGAC 12060 

CGCGAGCTGA TGCACAGCCT GCAAAGGGCC CTGGCTGGCA CGGGCAGCGG CGATAGAGAG 1212 0 

GCCGAGTCCT ACTTTGACGC GGGCGCTGAC CTGCGCTGGG CCCCAAGCCG ACGCGCCCTG 12180 

GAGGCAGCTG GGGCCGGACC TGGGCTGGCG GTGGCACCCG CGCGCGCTGG CAACGTCGGC 1224 0 

GGCGTGGAGG AATATGACGA GGACGATGAG TACGAGCCAG AGGACGGCGA GTACTAAGCG 12300 

GTGATGTTTC TGATCAGATG ATGCAAGACG CAACGGACCC GGCGGTGCGG GCGGCGCTGC 12360 

AGAGCCAGCC GTCCGGCCTT AACTCCACGG ACGACTGGCG CCAGGTCATG GACCGCATCA 12420 

TGTCGCTGAC TGCGCGCAAT CCTGACGCGT TCCGGCAGCA GCCGCAGGCC AACCGGCTCT 12480 

CCGCAATTCT GGAAGCGGTG GTCCCGGCGC GCGCAAACCC CACGCACGAG AAGGTGCTGG 12540 

CGATCGTAAA CGCGCTGGCC GAAAACAGGG CCATCCGGCC CGACGAGGCC GGCCTGGTCT 12600 

ACGACGCGCT GCTTCAGCGC GTGGCTCGTT ACAACAGCGG CAACGTGCAG ACCAACCTGG 12660 

ACCGGCTGGT GGGGGATGTG CGCGAGGCCG TGGCGCAGCG TGAGCGCGCG CAGCAGCAGG 12720 

GCAACCTGGG CTCCATGGTT GCACTAAACG CCTTCCTGAG TACACAGCCC GCCAACGTGC 12780 

CGCGGGGACA GGAGGACTAC ACCAACTTTG TGAGCGCACT GCGGCTAATG GTGACTGAGA 12840 

CACCGCAAAG TGAGGTGTAC CAGTCTGGGC CAGACTATTT TTTCCAGACC AGTAGACAAG 12900 

GCCTGCAGAC CGTAAACCTG AGCCAGGCTT TCAAAAACTT GCAGGGGCTG TGGGGGGTGC 12960 

GGGCTCCCAC AGGCGACCGC GCGACCGTGT CTAGCTTGCT GACGCCCAAC TCGCGCCTGT 13020 

TGCTGCTGCT AATAGCGCCC TTCACGGACA GTGGCAGCGT GTCCCGGGAC ACATACCTAG 13080 
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GTCACTTGCT 


GACACTGTAC 


CGCGAGGCCA 


TAGGTCAGGC 


GCATGTGGAC 


PCT/US97/19541 

GAGCATACTT 1314 0 


TCCAGGAGAT 


TACAAGTGTC 


AGCCGCGCGC 


TGGGGCAGGA 


GGACACGGGC 


AGCCTGGAGG 


13200 


CAACCCTAAA 


CTACCTGCTG 


ACCAACCGGC 


GGCAGAAGAT 


CCCCTCGTTG 


CACAGTTTAA 


13260 


ACAGCGAGGA 


GGAGCGCATT 


TTGCGCTACG 


TGCAGCAGAG 


CGTGAGCCTT 


AACCTGATGC 


13320 


GCGACGGGGT 


AACGCCCAGC 


GTGGCGCTGG 


ACATGACCGC 


GCGCAACATG 


GAACCGGGCA 


13380 


TGTATGCCTC 


AAACCGGCCG 


TTTATCAACC 


GCCTAATGGA 


CTACTTGCAT CGCGCGGCCG 


13440 


CCGTGAACCC 


CGAGTATTTC 


ACCAATGCCA 


TCTTGAACCC 


GCACTGGCTA 


CCGCCCCCTG 


13500 


GTTTCTACAC 


CGGGGGATTC 


GAGGTGCCCG 


AGGGTAACGA 


TGGATTCCTC 


TGGGACGACA 


13560 


TAGACGACAG 


CGTGTTTTCC 


CCGCAACCGC 


AGACCCTGCT 


AGAGTTGCAA 


CAGCGCGAGC 


13620 


AGGCAGAGGC 


GGCGCTGCGA 


AAGGAAAGCT 


TCCGCAGGCC 


AAGCAGCTTG 


TCCGATCTAG 


13680 


GCGCTGCGGC 


CCCGCGGTCA 


GATGCTAGTA 


GCCCATTTCC 


AAGCTTGATA 


GGGTCTCTTA 


13740 


CCAGCACTCG 


CACCACCCGC 


CCGCGCCTGC 


TGGGCGAGGA 


GGAGTACCTA 


AACAACTCGC 


13800 


TGCTGCAGCC 


GCAGCGCGAA 


AAAAACCTGC 


CTCCGGCATT 


TCCCAACAAC 


GGGATAGAGA 


13860 


GCCTAGTGGA 


CAAGATGAGT 


AGATGGAAGA 


CGTACGCGCA 


GGAGCACAGG 


GACGTGCCAG 


13920 


GCCCGCGCCC 


GCCCACCCGT 


CGTCAAAGGC 


ACGACCGTCA 


GCGGGGTCTG 


GTGTGGGAGG 


13980 


ACGATGACTC 


GGCAGACGAC 


AGCAGCGTCC 


TGGATTTGGG 


AGGGAGTGGC 


AACCCGTTTG 


14040 


CGCACCTTCG 


CCCCAGGCTG 


GGGAGAATGT 


TTTAAAAAAA 


AAAAAGCATG 


ATGCAAAATA 


14100 


AAAAACTCAC 


CAAGGCCATG 


GCACCGAGCG 


TTGGTTTTCT 


TGTATTCCCC 


TTAGTATGCG 


14160 


GCGCGCGGCG 


ATGTATGAGG 


AAGGTCCTCC 


TCCCTCCTAC 


GAGAGTGTGG 


TGAGCGCGGC 


14220 


GCCAGTGGCG 


GCGGCGCTGG 


GTTCTCCCTT 


CGATGCTCCC 


CTGGACCCGC 


CGTTTGTGCC 


14280 


TCCGCGGTAC 


CTGCGGCCTA 


CCGGGGGGAG 


AAACAGCATC 


CGTTACTCTG 


AGTTGGCACC 


14340 


CCTATTCGAC 


ACCACCCGTG 


TGTACCTGGT 


GGACAACAAG 


TCAACGGATG 


TGGCATCCCT 


14400 


GAACTACCAG 


AACGACCACA 


GCAACTTTCT 


GACCACGGTC 


ATTCAAAACA 


ATGACTACAG 


14460 


CCCGGGGGAG 


GCAAGCACAC 


AGACCATCAA 


TCTTGACGAC 


CGGTCGCACT 


GGGGCGGCGA 


14520 


CCTGAAAACC 


ATCCTGCATA 


CCAACATGCC 


AAATGTGAAC 


GAGTTCATGT 


TTACCAATAA 


14580 


GTTTAAGGCG 


CGGGTGATGG 


TGTCGCGCTT 


GCCTACTAAG 


GACAATCAGG 


TGGAGCTGAA 


14640 


ATACGAGTGG 


GTGGAGTTCA 


CGCTGCCCGA 


GGGCAACTAC 


TCCGAGACCA 


TGACCATAGA 


14700 


CCTTATGAAC 


AACGCGATCG 


TGGAGCACTA 


CTTGAAAGTG 


GGCAGACAGA 


ACGGGGTTCT 


14760 


GGAAAGCGAC 


ATCGGGGTAA 


AGTTTGACAC 


CCGCAACTTC 


AGACTGGGGT 


TTGACCCCGT 


14820 


CACTGGTCTT 


GTCATGCCTG 


GGGTATATAC 


AAACGAAGCC 


TTCCATCCAG 


ACATCATTTT 


14880 


GCTGCCAGGA 


TGCGGGGTGG 


ACTTCACCCA 


CAGCCGCCTG 


AGCAACTTGT 


TGGGCATCCG 


14940 


CAAGCGGCAA 


CCCTTCCAGG 


AGGGCTTTAG 


GATCACCTAC 


GATGATCTGG 


AGGGTGGTAA 


15000 


CATTCCCGCA 


CTGTTGGATG 


TGGACGCCTA 


CCAGGCGAGC 


TTGAAAGATG 


ACACCGAACA 


15060 


GGGCGGGGGT 


GGCGCAGGCG 


GCAGCAACAG 


CAGTGGCAGC 


GGCGCGGAAG 


AGAACTCCAA 


15120 


CGCGGCAGCC 


GCGGCAATGC 


AGCCGGTGGA 


GGACATGAAC 


GATCATGCCA 


TTCGCGGCGA 


15180 



- 76 - 



WO 98/17783 





PCT/US97/19541 


GAAGCAGCGG 


CCGAAGCTGC 


1524 0 


AAACCGGTGA 


TCAAACCCCT 


15300 


AATGACAGCA 


CCTTCACCCA 


15360 


CAGACCGGAA 


TCCGCTCATG 


15420 


CAGGTCTACT 


GGTCGTTGCC 


154 80 


CAGATCAGCA 


ACTTTCCGGT 


1554 0 


TACAACGACC 


AGGCCGTCTA 




TTCAATCGCT 




t c a c n 


GTCAGTGAAA 


ACGTTCCTGP 


T C *7 ~) n 


GGAGGAGTCC 


AGCGAGTGAC 


ij 'OU 


AAGGCCCTGG 


GCATAGTCTC 


1J04U 


TCCATCCTTA 


TATCGCCCAG 




TTTGGCGGGG 


CCAAGAAGCG 




GCGCCCTGGG 


GCGCGCACAA 


16020 


GACGCGGTGG 


TGGAGGAGGC 


16080 


GACGCGGCCA 


TTCAGACCGT 


16140 


CGGAGGCGCG 


TAGCACGTCG 


16200 



CCACCGCCGC CGACCCGGCA CTGCCGCCCA ACGCGCGGCG GCGGCCCTGC TTAACCGCGC 
ACGTCGCACC GGCCGACGGG CGGCCATGCG GGCCGCTCGA AGGCTGGCCG CGGGTATTGT 
CACTGTGCCC CCCAGGTCCA GGCGACGAGC GGCCGCCGCA GCAGCCGCGG CCATTAGTGC 
TATGACTCAG GGTCGCAGGG GCAACGTGTA TTGGGTGCGC GACTCGGTTA GCGGCCTGCG 
CGTGCCCGTG CGCACCCGCC CCCCGCGCAA CTAGATTGCA AGAAAAAACT ACTTAGACTC 
GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA GCTATGTCCA AGCGCAAAAT 
CAAAGAAGAG ATGCTCCAGG TCATCGCGCC GGAGATCTAT GGCCCCCCGA AGAAGGAAGA 
GCAGGATTAC AAGCCCCGAA AGCTAAAGCG GGTCAAAAAG AAAAAGAAAG ATGATGATGA 
TGAACTTGAC GACGAGGTGG AACTGCTGCA CGCTACCGCG CCCAGGCGAC GGGTACAGTG 
GAAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT TTACGCCCGG 
TGAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG GTGTACGGCG ACGAGGACCT 
GCTTGAGCAG GCCAACGAGC GCCTCGGGGA GTTTGCCTAC GGAAAGCGGC ATAAGGACAT 
GCTGGCGTTG CCGCTGGACG AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA 
GCAGGTGCTG CCCGCGCTTG CACCGTCCGA AGAAAAGCGC GGCCTAAAGC GCGAGTCTGG 
TGACTTGGCA CCCACCGTGC AGCTGATGGT ACCCAAGCGC CAGCGACTGG AAGATGTCTT 
GGAAAAAATG ACCGTGGAAC CTGGGCTGGA GCCCGAGGTC CGCGTGCGGC CAATCAAGCA 
GGTGGCGCCG GGACTGGGCG TGCAGACCGT GGACGTTCAG ATACCCACTA CCAGTAGCAC 
CAGTATTGCC ACCGCCACAG AGGGCATGGA GACACAAACG TCCCCGGTTG CCTCAGCGGT 



16260 

16320 

16380 

16440 

16500 

16560 

16620 

16680 

16740 

16800 

16860 

16920 

16980 

17040 

17100 

17160 

17220 

17280 
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GGCGGATGCC 


GCGGTGCAGG 


CGGTCGCTGC 


GGCCGCGTCC 


AAGACCTCTA 


CGGAGGTGCA 


17340 


AACGGACCCG 


TGGATGTTTC 


GCGTTTCAGC 


CCCCCGGCGC 


CCGCGCGGTT 


CGAGGAAGTA 


•17400 


CGGCGCCGCC 


AGCGCGCTAC 


TGCCCGAATA 


TGCCCTACAT 


CCTTCCATTG 


CGCCTACCCC 


17460 


CGGCTATCGT 


GGCTACACCT 


ACCGCCCCAG 


AAGACGAGCA 


ACTACCCGAC 


GCCGAACCAC 


17520 


CACTGGAACC 


CGCCGCCGCC 


GTCGCCGTCG 


CCAGCCCGTG 


CTGGCCCCGA 


TTTCCGTGCG 


17580 


CAGGGTGGCT 


CGCGAAGGAG 


GCAGGACCCT 


GGTGCTGCCA 


ACAGCGCGCT 


ACCACCCCAG 


17640 


CATCGTTTAA 


AAGCCGGTCT 


TTGTGGTTCT 


TGCAGATATG 


GCCCTCACCT 


GCCGCCTCCG 


17700 


TTTCCCGGTG 


CCGGGATTCC 


GAGGAAGAAT 


GCACCGTAGG 


AGGGGCATGG 


CCGGCCACGG 


17760 


CCTGACGGGC 


GGCATGCGTC 


GTGCGCACCA 


CCGGCGGCGG 


CGCGCGTCGC 


ACCGTCGCAT 


17820 


GCGCGGCGGT 


ATCCTGCCCC 


TCCTTATTCC 


ACTGATCGCC 


GCGGCGATTG 


GCGCCGTGCC 


17880 


CGGAATTGCA 


TCCGTGGCCT 


TGCAGGCGCA 


GAGACACTGA 


TTAAAAACAA 


GTTGCATGTG 


17940 


GAAAAATCAA 


AATAAAAAGT 


CTGGACTCTC 


ACGCTCGCTT 


GGTCCTGTAA 


CTATTTTGTA 


18000 


GAATGGAAGA 


CATCAACTTT 


GCGTCTCTGG 


CCCCGCGACA 


CGGCTCGCGC 


CCGTTCATGG 


18060 


GAAACTGGCA 


AGATATCGGC 


ACCAGCAATA 


TGAGCGGTGG 


CGCCTTCAGC 


TGGGGCTCGC 


18120 


TGTGGAGCGG 


CATTAAAAAT 


TTCGGTTCCA 


CCGTTAAGAA 


CTATGGCAGC 


AAGGCCTGGA 


18180 


ACAGCAGCAC 


AGGCCAGATG 


CTGAGGGATA 


AGTTGAAAGA 


GCAAAATTTC 


CAACAAAAGG 


18240 


TGGTAGATGG 


CCTGGCCTCT 


GGCATTAGCG 


GGGTGGTGGA 


CCTGGCCAAC 


CAGGCAGTGC 


18300 


AAAATAAGAT 


TAACAGTAAG 


CTTGATCCCC 


GCCCTCCCGT 


AGAGGAGCCT 


CCACCGGCCG 


18360 


TGGAGACAGT 


GTCTCCAGAG 


GGGCGTGGCG 


AAAAGCGTCC 


GCGCCCCGAC 


AGGGAAGAAA 


18420 


CTCTGGTGAC 


GCAAATAGAC 


GAGCCTCCCT 


CGTACGAGGA 


GGCACTAAAG 


CAAGGCCTGC 


18480 


CCACCACCCG 


TCCCATCGCG 


CCCATGGCTA 


CCGGAGTGCT 


GGGCCAGCAC 


ACACCCGTAA 


18540 


CGCTGGACCT 


GCCTCCCCCC 


GCCGACACCC 


AGCAGAAACC 


TGTGCTGCCA 


GGCCCGACCG 


18600 


CCGTTGTTGT 


AACCCGTCCT 


AGCCGCGCGT 


CCCTGCGCCG 


CGCCGCCAGC 


GGTCCGCGAT 


18660 


CGTTGCGGCC 


CGTAGCCAGT 


GGCAACTGGC 


AAAGCACACT 


GAACAGCATC 


GTGGGTCTGG 


18720 


GGGTGCAATC 


CCTGAAGCGC 


CGACGATGCT 


TCTGAATAGC 


TAACGTGTCG 


TATGTGTGTC 


18780 


ATGTATGCGT 


CCATGTCGCC 


GCCAGAGGAG 


CTGCTGAGCC 


GCCGCGCGCC 


CGCTTTCCAA 


18840 


GATGGCTACC 


CCTTCGATGA 


TGCCGCAGTG 


GTCTTACATG 


CACATCTCGG 


GCCAGGACGC 


18900 


CTCGGAGTAC 


CTGAGCCCCG 


GGCTGGTGCA 


GTTTGCCCGC 


GCCACCGAGA 


CGTACTTCAG 


18960 


CCTGAATAAC 


AAGTTTAGAA 


ACCCCACGGT 


GGCGCCTACG 


CACGACGTGA 


CCACAGACCG 


19020 


GTCCCAGCGT 


TTGACGCTGC 


GGTTCATCCC 


TGTGGACCGT 


GAGGATACTG 


CGTACTCGTA 


19080 


CAAGGCGCGG 


TTCACCCTAG 


CTGTGGGTGA 


TAACCGTGTG 


CTGGACATGG 


CTTCCACGTA 


19140 


CTTTGACATC 


CGCGGCGTGC 


TGGACAGGGG 


CCCTACTTTT 


AAGCCCTACT 


CTGGCACTGC 


19200 


CTACAACGCC 


CTGGCTCCCA 


AGGGTGCCCC 


AAATCCTTGC 


GAATGGGATG 


AAGCTGCTAC 


19260 


TGCTC7TGAA 


ATAAACCTAG 


AAGAAGAGGA 


CGATGACAAC 


GAAGACGAAG 


TAGACGAGCA 


19320 


AGCTGAGCAG 


CAAAAAACTC 


ACGTATTTGG 


GCAGGCGCCT 


TATTCTGGTA 


TAAATATTAC 


19380 
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AAATATGCCG 


ATAAAACATT 


1944 0 


ACTGAAATTA 


ATCATGCAGC 


19500 


TACGGTTCAT 


ATGCAAAACC 


19560 


LAAAATGGAA 


AGCTAGAAAG 


19620 


bCAGGCAATG 


GTGATAACTT 


19680 


bAAACCCCAG 


ACACTCATAT 


19740 


CTAATGGGCC 


AACAATCTAT 


19800 


ATTGGTCTAA 


TGTATTACAA 




CAGTTGAATG 


CTGTTGTAGA 


J- 5 j ^ U 


CTTGATTCCA 


TTGGTGATAG 


1 Qoon 


TATGATCCAG 


ATGTTAGAAT 


* U (J 


TGCTTTCCAC 


TGGGAGGTGT 


J.UU 


GGTCAGGAAA 


ATGGATGGGA 


20160 


GTTGGAAATA 


ATTTTGCCAT 


20220 


TACTCCAACA 


TAGCGCTGTA 


20280 


ATTTCTGATA 


ACCCAAACAC 


20340 



CTACGACTAC ATGAACAAGC GAGTGGTGGC TCCCGGGTTA GTGGACTGCT ACATTAACCT 
TGGAGCACGC TGGTCCCTTG ACTATATGGA CAACGTCAAC CCATTTAACC ACCACCGCAA 
TGCTGGCCTG CGCTACCGCT CAATGTTGCT GGGCAATGGT CGCTATGTGC CCTTCCACAT 
CCAGGTGCCT CAGAAGTTCT TTGCCATTAA AAACCTCCTT CTCCTGCCGG GCTCATACAC 
CTACGAGTGG AACTTCAGGA AGGATGTTAA CATGGTTCTG CAGAGCTCCC TAGGAAATGA 
CCTAAGGGTT GACGGAGCCA GCATTAAGTT TGATAGCATT TGCCTTTACG CCACCTTCTT 
CCCCATGGCC CACAACACCG CCTCCACGCT TGAGGCCATG CTTAGAAACG ACACCAACGA 
CCAGTCCTTT AACGACTATC TCTCCGCCGC CAACATGCTC TACCCTATAC CCGCCAACGC 
TACCAACGTG CCCATATCCA TCCCCTCCCG CAACTGGGCG GCTTTCCGCG GCTGGGCCTT 
CACGCGCCTT AAGACTAAGG AAACCCCATC ACTGGGCTCG GGCTACGACC CTTATTACAC 
CTACTCTGGC TCTATACCCT ACCTAGATGG AACCTTTTAC CTCAACCACA CCTTTAAGAA 
GGTGGCCATT ACCTTTGACT CTTCTGTCAG CTGGCCTGGC AATGACCGCC TGCTTACCCC 
CAACGAGTTT GAAATTAAGC GCTCAGTTGA CGGGGAGGGT TACAACGTTG CCCAGTGTAA 
CATGACCAAA GACTGGTTCC TGGTACAAAT GCTAGCTAAC TACAACATTG GCTACCAGGG 
CTTCTATATC CCAGAGAGCT ACAAGGACCG CATGTACTCC TTCTTTAGAA ACTTCCAGCC 
CATGAGCCGT CAGGTGGTGG ATGATACTAA ATACAAGGAC TACCAACAGG TGGGCATCCT 
ACACCAACAC AACAACTCTG GATTTGTTGG CTACCTTGCC CCCACCATGC GCGAAGGACA 
GGCCTACCCT GCTAACTTCC CCTATCCGCT TATAGGCAAG ACCGCAGTTG ACAGCATTAC 
CCAGAAAAAG TTTCTTTGCG ATCGCACCCT TTGGCGCATC CCATTCTCCA GTAACTTTAT 



20400 

20460 

20520 

20560 
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20700 

20760 
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GTCCATGGGC 


GCACTCACAG 


ACCTGGGCCA 


AAACCTTCTC 


TACGCCAACT 


PCT/US97/19541 

CCGCCCACGC 21540 


GCTAGACATG 


ACTTTTGAGG 


TGGATCCCAT 


GGACGAGCCC 


ACCCTTCTTT 


ATGTTTTG7T 


21600 


TGAAGTCTTT 


GACGTGGTCC 


GTGTGCACCG 


GCCGCACCGC 


GGCGTCATCG 


AAACCGTGTA 


21660 


CCTGCGCACG 


CCCTTCTCGG 


CCGGCAACGC 


CACAACATAA 


AGAAGCAAGC 


AACATCAACA 


2172C 


ACAGCTGCCG 


CCATGGGCTC 


CAGTGAGCAG 


GAACTGAAAG 


CCATTGTCAA 


AGATCTTGGT 


21780 


TGTGGGCCAT 


ATTTTTTGGG 


CACCTATGAC 


AAGCGCTTTC 


CAGGCTTTGT 


TTCTCCACAC 


21840 


AAGCTCGCCT 


GCGCCATAGT 


CAATACGGCC 


GGTCGCGAGA 


CTGGGGGCGT 


ACACTGGATG 


21900 


GCCTTTGCCT 


GGAACCCGCA 


CTCAAAAACA 


TGCTACCTCT 


TTGAGCCCTT 


TGGCTTTTCT 


21960 


GACCAGCGAC 


TCAAGCAGGT 


TTACCAGTTT 


GAGTACGAGT 


CACTCCTGCG 


CCGTAGCGCC 


22020 


ATTGCTTCTT 


CCCCCGACCG 


CTGTATAACG 


CTGGAAAAGT 


CCACCCAAAG 


CGTACAGGGG 


22080 


CCCAACTCGG 


CCGCCTGTGG 


ACTATTCTGC 


TGCATGTTTC 


TCCACGCCTT 


TGCCAACTGG 


2214C 


CCCCAAACTC 


CCATGGATCA 


CAACCCCACC 


ATGAACCTTA 


TTACCGGGGT 


ACCCAACTCC 


22200 


ATGCTCAACA 


GTCCCCAGGT 


ACAGCCCACC 


CTGCGTCGCA 


ACCAGGAACA 


GCTCTACAGC 


22260 


TTCCTGGAGC 


GCCACTCGCC 


CTACTTCCGC 


AGCCACAGTG 


CGCAGATTAG 


GAGCGCCACT 


22320 


TCTTTTTGTC 


ACTTGAAAAA 


CATGTAAAAA 


TAATGTACTA 


GAGACACTTT 


CAATAAAGGC 


22380 


AAATGCTTTT 


ATTTGTACAC 


TCTCGGGTGA 


TTATTTACCC 


CCACCCTTGC 


CGTCTGCGCC 


22440 


GTTTAAAAAT 


CAAAGGGGTT 


CTGCCGCGCA 


TCGCTATGCG 


CCACTGGCAG 


GGACACGTTG 


22500 


CGATACTGGT 


GTTTAGTGCT 


CCACTTAAAC 


TCAGGCACAA 


CCATCCGCGG 


CAGCTCGGTG 


2256C 


AAGTTTTCAC 


TCCACAGGCT 


GCGCACCATC 


ACCAACGCGT 


TTAGCAGGTC 


GGGCGCCGAT 


2262C 


ATCTTGAAGT 


CGCAGTTGGG 


GCCTCCGCCC 


TGCGCGCGCG 


AGTTGCGATA 


CACAGGGTTG 


22680 


CAGCACTGGA 


ACACTATCAG 


CGCCGGGTGG 


TGCACGCTGG 


CCAGCACGCT 


CTTGTCGGAG 


22740 


ATCAGATCCG 


CGTCCAGGTC 


CTCCGCGTTG 


CTCAGGGCGA 


ACGGAGTCAA 


CTTTGGTAGC 


22800 


TGCCTTCCCA 


AAAAGGGCGC 


GTGCCCAGGC 


TTTGAGTTGC 


ACTCGCACCG 


TAG TGG CATC 


22860 


AAAAGGTGAC 


CGTGCCCGGT 


CTGGGCGTTA 


GGATACAGCG 


CCTGCATAAA 


AGCCTTGATC 


22920 


TGCTTAAAAG 


CCACCTGAGC 


CTTTGCGCCT 


TCAGAGAAGA 


ACATGCCGCA 


AGACTTGCCG 


22980 


G AAAAC TG AT 


TGGCCGGACA 


GGCCGCGTCG 


TGCACGCAGC 


ACCTTGCGTC 


GGTGTTGGAG 


23040 


ATCTGCACCA 


CATTTCGGCC 


CCACCGGTTC 


TTCACGATCT 


TGGCCTTGCT 


AGACTGCTCC 


23100 


TTCAGCGCGC 


GCTGCCCGTT 


TTCGCTCGTC 


ACATCCATTT 


CAATCACGTG 


CTCCTTATTT 


23160 


ATCATAATGC 


TTCCGTGTAG 


ACACTTAAGC 


TCGCCTTCGA 


TCTCAGCGCA 


GCGGTGCAGC 


23220 


CACAACGCGC 


AGCCCGTGGG 


CTCGTGATGC 


TTGTAGGTCA 


CCTCTGCAAA 


CGACTGCAGG 


23280 


TACGCCTGCA 


GGAATCGCCC 


CATCATCGTC 


ACAAAGGTCT 


TGTTGCTGGT 


GAAGGTCAGC 


23340 


TGCAACCCGC 


GGTGCTCCTC 


GTTCAGCCAG 


GTCTTGCATA 


CGGCCGCCAG 


AGCTTCCACT 


23400 


TGGTCAGGCA 


GTAGTTTGAA 


GTTCGCCTTT 


AGATCGTTAT 


CCACGTGGTA 


CTTGTCCATC 


23460 


AGCGCGCGCG 


CAGCCTCCAT 


GCCCTTCTCC 


CACGCAGACA 


CGATCGGCAC 


ACTCAGCGGG 


23520 


TTCATCACCG 


TAATTTCACT 


TTCCGCTTCG 


CTGGGCTCTT 


CCTCTTCCTC 


TTGCGTCCGC 


23580 
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23640 
23700 
23760 
23820 
23880 



ATACCACGCG CCACTGGGTC GTCTTCATTC AGCCGCCGCA CTGTGCGCTT ACCTCCTTTG 
CCATGCTTGA TTAGCACCGG TGGGTTGCTG AAACCCACCA TTTGTAGCGC CACATCTTCT 
CTTTCTTCCT CGCTGTCCAC GATTACCTCT GGTGATGGCG GGCGCTCGGG CTTGGGAGAA 
GGGCGCTTCT TTTTCTTCTT GGGCGCAATG GCCAAATCCG CCGCCGAGGT CGATGGCCGC 
GGGCTGGGTG TGCGCGGCAC CAGCGCGTCT TGTGATGAGT CTTCCTCGTC CTCGGACTCG 
ATACGCCGCC TCATCCGCTT TTTTGGGGGC GCCCGGGGAG GCGGCGGCGA CGGGGACGGG 23940 
GACGACACGT CCTCCATGGT TGGGGGACGT CGCGCCGCAC CGCGTCCGCG CTCGGGGGTG 24000 
GTTTCGCGCT GCTCCTCTTC CCGACTGGCC ATTTCCTTCT CCTATAGGCA GAAAAAGATC 
ATGGAGTCAG TCGAGAAGAA GGACAGCCTA ACCGCCCCCT CTGAGTTCGC CACCACCGCC 
TCCACCGATG CCGCCAACGC GCCTACCACC TTCCCCGTCG AGGCACCCCC GCTTGAGGAG 
GAGGAAGTGA TTATCGAGCA GGACCCAGGT TTTGTAAGCG AAGACGACGA GGACCGCTCA 
GTACCAACAG AGGATAAAAA GCAAGACCAG GACAACGCAG AGGCAAACGA GGAACAAGTC 24300 
GGGCGGGGGG ACGAAAGGCA TGGCGACTAC CTAGATGTGG GAGACGACGT GCTGTTGAAG 24360 
CATCTGCAGC GCCAGTGCGC CATTATCTGC GACGCGTTGC AAGAGCGCAG CGATGTGCCC 24420 



24060 
24120 
24180 
24240 



CTCGCCATAG 


CGGATGTCAG 


CCTTGCCTAC 




TATTCTCACC 


GCGCGTACCC 


24480 


CCCAAACGCC 


AAGAAAACGG 


CACATGCGAG 




GCCTCAACTT 


CTACCCCGTA 


24540 


TTTGCCGTGC 


CAGAGGTGCT 


TGCCACCTAT 


C A C AT C TT TT 


TCCAAAACTG 


CAAGATACCC 


24600 


CTATCCTGCC 


GTGCCAACCG 


CAGCCGAGCG 


GACAAGCAGC 


TGGCCTTGCG 


GCAGGGCGCT 


24660 


GTCATACCTG 


ATATCGCCTC 


GCTCAACGAA 


GTGCCAAAAA 


TCTTTGAGGG 


TCTTGGACGC 


2472C 


GACGAGAAGC 


GCGCGGCAAA 


CGCTCTGCAA 


CAGGAAAACA 


GCGAAAATGA 


AAGTCACTCT 


24780 


GGAGTGTTGG 


TGGAACTCGA 


GGGTGACAAC 


GCGCGCCTAG 


CCGTACTAAA 


ACGCAGCATC 


24840 


GAGGTCACCC 


ACTTTGCCTA 


CCCGGCACTT 


AACCTACCCC 


CCAAGGTCAT 


GAGCACAGTC 


24900 


ATGAGTGAGC 


TGATCGTGCG 


CCGTGCGCAG 


CCCCTGGAGA 


GGGATGCAAA 


TTTGCAAGAA 


24960 


CAAACAGAGG 


AGGGCCTACC 


CGCAGTTGGC 


GACGAGCAGC 


TAGCGCGCTG 


GCTTCAAACG 


25020 


CGCGAGCCTG 


CCGACTTGGA 


GGAGCGACGC 


AAACTAATGA 


TGGCCGCAGT 


GCTCGTTACC 


25080 


GTGGAGCTTG 


AGTGCATGCA 


GCGGTTCTTT 


GCTGACCCGG 


AGATGCAGCG 


CAAGCTAGAG 


25140 


GAAACATTGC 


ACTACACCTT 


TCGACAGGGC 


TACGTACGCC 


AGGCCTGCAA 


GATCTCCAAC 


25200 


GTGGAGCTCT 


GCAACCTGGT 


CTCCTACCTT 


GGAATTTTGC 


ACGAAAACCG 


CCTTGGGCAA 


25260 


AACGTGCTTC 


ATTCCACGCT 


CAAGGGCGAG 


GCGCGCCGCG 


ACTACGTCCG 


CGACTGCGTT 


25320 


TACTTATTTC 


TATGCTACAC 


CTGGCAGACG 


GCCATGGGCG 


TTTGGCAGCA GTGCTTGGAG 


25380 


GAGTGCAACC 


TCAAGGAGCT 


GCAGAAACTG 


CTAAAGCAAA 


ACTTGAAGGA 


CCTATGGACG 


25440 


GCCTTCAACG 


AGCGCTCCGT 


GGCCGCGCAC 


CTGGCGGACA 


TCATTTTCCC 


CGAACGCCTG 


25500 


CTTAAAACCC 


TGCAACAGGG 


TCTGCCAGAC 


TTCACCAGTC 


AAAGCATGTT 


GCAGAACTTT 


25560 


AGGAACTTTA 


TCCTAGAGCG 


CTCAGGAATC 


TTGCCCGCCA 


CCTGCTGTGC 


ACTTCCTAGC 


25620 


GACTTTGTGC 


CCATTAAGTA 


CCGCGAATGC 


CCTCCGCCGC 


TTTGGGGCCA 


CTGCTACCTT 


25680 
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CTGCAGCTAG 


CCAACTACCT 


TGCCTACCAC 


TCTGACATAA 


TGGAAGACGT 


GAGCGGTGAC 


25740 


GGTCTACTGG 


AGTGTCACTG 


TCGCTGCAAC 


CTATGCACCC 


CGCACCGCTC 


CCTGGTTTGC 


25800 


AATTCGCAGC 


TGCTTAACGA 


AAGTCAAATT 


ATCGGTACCT 


TTGAGCTGCA 


GGGTCCCTCG 


25860 


CCTGACGAAA 


AGTCCGCGGC 


TCCGGGGTTG 


AAACTCACTC 


CGGGGCTGTG 


GACGTCGGCT 


25920 


TACCTTCGCA 


AATTTGTACC 


TGAGGACTAC 


CACGCCCACG 


AGATTAGGTT 


CTACGAAGAC 


25980 


CAATCCCGCC 


CGCCAAATGC 


GGAGCTTACC 


GCCTGCGTCA 


TTACCCAGGG 


CCACATTCTT 


26040 


GGCCAATTGC 


AAGCCATCAA 


CAAAGCCCGC 


CAAGAGTTTC 


TGCTACGAAA 


GGGACGGGGG 


26100 


GTTTACTTGG 


ACCCCCAGTC 


CGGCGAGGAG 


CTCAACCCAA 


TCCCCCCGCC 


GCCGCAGCCC 


26160 


TATCAGCAGC 


AGCCGCGGGC 


CCTTGCTTCC 


CAGGATGGCA 


CCCAAAAAGA 


AGCTGCAGCT 


26220 


GCCGCCGCCA 


CCCACGGACG 


AGGAGGAATA 


CTGGGACAGT 


CAGGCAGAGG 


AGGTTTTGGA 


26280 


CGAGGAGGAG 


GAGGACATGA 


TGGAAGACTG 


GGAGAGCCTA 


GACGAGGAAG 


CTTCCGAGGT 


26340 


CGAAGAGGTG 


TCAGACGAAA 


CACCGTCACC 


CTCGGTCGCA 


TTCCCCTCGC 


CGGCGCCCCA 


26400 


GAAATCGGCA 


ACCGGTTCCA 


GCATGGCTAC 


AACCTCCGCT 


CCTCAGGCGC 


CGCCGGCACT 


26460 


GCCCGTTCGC 


CGACCCAACC 


GTAGATGGGA 


CACCACTGGA 


ACCAGGGCCG 


GTAAGTCCAA 


26520 


GCAGCCGCCG 


CCGTTAGCCC 


AAGAGCAACA 


ACAGCGCCAA 


GGCTACCGCT 


CATGGCGCGG 


26580 


GCACAAGAAC 


GCCATAGTTG 


CTTGCTTGCA 


AGACTGTGGG 


GGCAACATCT 


CCTTCGCCCG 


26640 


CCGCTTTCTT 


CTCTACCATC 


ACGGCGTGGC 


CTTCCCCCGT 


AACATCCTGC 


ATT ACT AC CG 


26700 


TCATCTCTAC 


AGCCCATACT 


GCACCGGCGG 


CAGCGGCAGC 


GGCAGCAACA 


GCAGCGGCCA 


26760 


CACAGAAGCA 


AAGGCGACCG 


GATAGCAAGA 


CTCTGACAAA 


GCCCAAGAAA 


TCCACAGCGG 


26820 


CGGCAGCAGC 


AGGAGGAGGA 


GCGCTGCGTC 


TGGCGCCCAA 


CGAACCCGTA 


TCGACCCGCG 


26880 


AGCTTAGAAA 


CAGGATTTTT 


CCCACTCTGT 


ATGCTATATT 


TCAACAGAGC 


AGGGGCCAAG 


26940 


AACAAGAGCT 


GAAAATAAAA 


AACAGGTCTC 


TGCGATCCCT 


CACCCGCAGC 


TGCCTGTATC 


27000 


ACAAAAGCGA 


AGATCAGCTT 


CGGCGCACGC 


TGGAAGACGC 


GGAGGCTCTC 


TTCAGTAAAT 


27060 


ACTGCGCGCT 


GACTCTTAAG 


GACTAGTTTC 


GCGCCCTTTC 


TCAAATTTAA 


GCGCGAAAAC 


27120 


TACGTCATCT 


CCAGCGGCCA 


CACCCGGCGC 


CAGCACCTGT 


CGTCAGCGCC 


ATTATGAGCA 


27180 


AGGAAATTCC 


CACGCCCTAC 


ATGTGGAGTT 


ACCAGCCACA 


AATGGGACTT 


GCGGCTGGAG 


27240 


CTGCCCAAGA 


CTACTCAACC 


CGAATAAACT 


ACATGAGCGC 


GGGACCCCAC 


ATGATATCCC 


27300 


GGGTCAACGG 


AATCCGCGCC 


CACCGAAACC 


GAATTCTCTT 


GGAACAGGCG 


GCTATTACCA 


27360 


CCACACCTCG 


TAATAACCTT 


AATCCCCGTA 


GTTGGCCCGC 


TGCCCTGGTG 


TACCAGGAAA 


27420 


GTCCCGCTCC 


CACCACTGTG 


GTACTTCCCA 


GAGACGCCCA 


GGCCGAAGTT 


CAGATGACTA 


27480 


ACTCAGGGGC 


GCAGCTTGCG 


GGCGGCTTTC 


GTCACAGGGT 


GCGGTCGCCC 


GGGCAGGGTA 


27540 


TAACTCACCT 


GACAATCAGA 


GGGCGAGGTA 


TTCAGGTCAA 


CGACGAGTCG 


GTGAGCTCCT 


27600 


CGCTTGGTCT 


CCGTCCGGAC 


GGGACATTTC 


AGATCGGCGG 


CGCCGGCCGT 


CCTTCATTCA 


27660 


CGCCTCGTCA 


GGCAATCCTA 


ACTCTGCAGA 


CCTCGTCCTC 


TGAGCCGCGC 


TCTGGAGGCA 


27720 


TTGGAACTCT 


GCAATTTATT 


GAGGAGTTTG 


TGCCATCGGT 


CTACTTTAAC 


CCCTTCTCGG 


27780 
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GACCTCCCGG CCACTATCCG GATCAATTTA TTCCTAACTT TGACGCGGTA AAGGACTCGG 



CGGACGGCTA CGACTGAATG TTAAGTGGAG AGGCAGAGCA ACTGCGCCTG AAACACCTGG 
TCCACTGTCG CCGCCACAAG TGCTTTGCCC GCGACTCCGG TGAGTTTTGC TACTTTGAAT 
7GCCCGAGGA TCATATCGAG GGCCCGGCGC ACGGCGTCCG GCTTACCGCC CAGGGAGAGC 
TTGCCCG7AG CCTGATTCGG GAGTTTACCC AGCGCCCCCT GCTAGTTGAG CGGGACAGGG 
GACCCTG7GT TCTCACTGTG ATTTGCAACT GTCCTAACCT TGGATTACAT CAAGATCTT? 
GTTGCCATCT CTGTGCTGAG TATAATAAAT ACAGAAATTA AAATATACTG GGGCTCCTAT 
CGCCATCCTG TAAACGCCAC CGTCTTCACC CGCCCAAGCA AACCAAGGCG AACCTTACC" 
GGTACTTTTA ACATCTCTCC CTCTGTGATT TACAACAGTT TCAACCCAGA CGGAGTGAGT 
CTACGAGAGA ACCTCTCCGA GCTCAGCTAC TCCATCAGAA AAAACACCAC CCTCCTTACC 
TGCCGGGAAC GTACGAGTGC GTCACCGGCC GCTGCACCAC ACCTACCGCC TGACCGTAAA 
CCAGACTTTT TCCGGACAGA CCTCAATAAC TCTGTTTACC AGAACAGGAG GTGAGCTTAG 
AAAACCC7TA GGGTATTAGG CCAAAGGCGC AGCTACTGTG GGGTTTATGA ACAATTCAAG 
CAACTCTACG GGCTATTCTA ATTCAGGTTT CTCTAGAATC GGGGTTGGGG TTATTCTCTG 
TCTTGTGATT CTCTTTATTC TTATACTAAC GCTTCTCTGC CTAAGGCTCG CCGCCTGCTG 
TGTGCACA7T TGCATTTATT GTCAGCTTTT TAAACGCTGG GGTCGCCACC CAAGATGAT" r 
AGG7ACA7AA TCCTAGGTTT ACTCACCCTT GCGTCAGCCC ACGGTACCAC CCAAAAGGTG 
GATTTTAAGG AGCCAGCCTG TAATGTTACA TTCGCAGCTG AAGCTAATGA GTGCACCACT 
CTTATAAAAT GCACCACAGA ACATGAAAAG CTGCTTATTC GCCACAAAAA CAAAATTGGC 
AAGTATGC7G TTTATGCTAT TTGGCAGCCA GGTGACACTA CAGAGTATAA TGTTACAGTT 
TTCCAGGGTA AAAGTCATAA AACTTTTATG TATACTTTTC CATTTTATGA AATGTGCGAC 
ATTACCATGT ACATGAGCAA ACAGTATAAG TTGTGGCCCC CACAAAATTG TGTGGAAAAC 
ACTGGCACTT TCTGCTGCAC TGCTATGCTA ATTACAGTGC TCGCTTTGGT CTGTACCCTA 
CTC7ATA77A AATACAAAAG CAGACGCAGC TTTATTGAGG AAAAGAAAAT GCCTTAATTT 
ACTAAG7TAC AAAGCTAATG TCACCACTAA CTGCTTTACT CGCTGCTTGC AAAACAAATT 
CAAAAAG7TA GCATTATAAT TAGAATAGGA TTTAAACCCC CCGGTCATTT CCTGCTCAAT 
ACCATTCCCC TGAACAATTG ACTCTATGTG GGATATGCTC CAGCGCTACA ACCTTGAAGT 
CAGGCT7CCT GGATGTCAGC ATCTGACTTT GGCCAGCACC TGTCCCGCGG ATTTGTTCCA 
GTCCAAC7AC AGCGACCCAC CCTAACAGAG ATGACCAACA CAACCAACGC GGCCGCCGCT 
ACCGGAC7TA CATCTACCAC AAATACACCC CAAG7TTCTG CCTTTGTCAA TAACTGGGAT 
AACTTGGGCA TGTGGTGG7T CTCCA7AGCG CTTATGT7TG TATGCCTTAT TATTATGTGG 
CTCATC7GCT GCC7AAAGCG CAAACGCGCC CGACCACCCA TC7ATAGTCC CATCATTGTG 
CTACACCCAA ACAATGATGG AATCCATAGA TTGGACGGAC TGAAACACAT GTTCTTTTCT 29760 
CTTACAG7AT GATTAAATGA GACATGATTC CTCGAGT7TT TATATTACTG ACCCTTGTTG 29820 
CGCTT777TG TGCGTGCTCC ACATTGGCTG CGGTTTCTCA CATCGAAGTA GACTGCATTC 29880 



27840 
27900 
27960 
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28260 
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CAGCCTTCAC 
TCACTGTGGT 
TCAGACACCA 
TATGAAATTT 
GACCTCCAAG 
TTGCTACAAT 
GGTGTTCTGC 
ACGAATAGAT 
AGTTGTTGCC 
TGAAATCAGC 
ACGGAATTAT 
GCATGAATCA 
GTCTGGTAAA 
ACAAGTTGCC 
TAACTCAGCA 
ATCTCTGCAC 
AAAAAAAAAT 
TTCAGCAGCA 
AACTTTCTCC 
ACTATCTTCA 
GTGTATCCAT 
GTATCCCCCA 
CCTCTAGTTA 
GAGGCCGGCA 
AAGTCAAACA 
GTGGCTGCCG 
CCGCTAACCG 
GAAGGAAAGC 
ACTATCACTG 
GAGCCCATTT 
ACAGACGACC 
TCCTTGCAAA 
AATGTAGCAG 
TATCCGTTTG 
AACTCAGCCC 



AGTCTATTTG 
CATCGCCTTT 
TCCCCAGTAC 
ACTGTGACTT 
CCTCAAAGAC 
GAAAAAAGCG 
AGTACCATCT 
GCCATGAACC 
GGCGGCTTTG 
TACTTTAATC 
TACAGAGCAG 
AGAGCTCCAA 
GCAGGCCAAA 
AACCAAGCGT 
CTCGGTAGAA 
CCTTATTAAG 
AATAAAGCAT 
CCTCCTTGCC 
ACAATCTAAA 
TGTTGTTGCA 
ATGACACGGA 
ATGGGTTTCA 
CCTCCAATGG 
ACCTTACCTC 
TAAACCTGGA 
CCGCACCTCT 
TGCACGACTC 
TAGCCCTGCA 
CCTCACCCCC 
ATACACAAAA 
TAAACACTTT 
CTAAAGTTAC 
GAGGACTAAG 
ATGCTCAAAA 
ACAACTTGGA 



CTTTACGGAT 
ATCCAGTGCA 
AGGGACAGGA 
TTCTGCTGAT 
ATATATCATG 
ATCTTTCCGA 
TAGCCCTAGC 
ACCCAACTTT 
TCCCAGCCAA 
TAACAGGAGG 
CGCCTGCTAG 
GACATGGTTA 
GTCACCTACG 
CAGAAATTGG 
ACCGAAGGCT 
ACCCTGTGCG 
CACTTACTTA 
CTCCTCCCAG 
TGGAATGTCA 
GATGAAGCGC 
AACCGGTCCT 
AGAGAGTCCC 
CATGCTTGCG 
CCAAAATGTA 
AATATCTGCA 
AATGGTCGCG 
CAAACTTAGC 
AACATCAGGC 
TCTAACTACT 
TGGAAAACTA 
GACCGTAGCA 
TGGAGCCTTG 
GATTGATTCT 
CCAACTAAAT 
TATTAACTAC 



TTGTCACCCT 
T7GACTGGGT 
CTATAGCTGA 
TATTTGCACC 
CAGATTCACT 
AGCCTGGTTA 
TATATATCCC 
CCCCGCGCCC 
TCAGCCTCGC 
AGATGACTGA 
AAAGACGCAG 
ACTTGCACCA 
ACAGTAATAC 
TGGTCATGGT 
GCATTCACTC 
GTCTCAAAGA 
AAATCAGTTA 
CTCTGGTATT 
GTTTCCTCCT 
GCAAGACCGT 
CCAACTGTGC 
CCTGGGGTAC 
CTCAAAATGG 
ACCACTGTGA 
CCCCTCACAG 
GGCAACACAC 
ATTGCCACCC 
CCCCTCACCA 
GCCACTGGTA 
GGACTAAAGT 
ACTGGTCCAG 
GGTTTTGATT 
CAAAACAGAC 
CTAAGACTAG 
AACAAAGGCC 



CACGCTCATC 
CTGTGTGCGC 
GCTTCTTAGA 
CTATCTGCGT 
CGTATATGGA 
TATGCAATCA 
TACCTTGACA 
GCTATGCTTC 
CCCACTTCTC 
CACCCTAGAT 
GGCAGCGGCC 
GTGCAAAAGG 
CACCGGACAC 
GGGAGAAAAG 
ACCTTGTCAA 
TCTTATTCCC 
GCAAATTTCT 
GCAGCTTCCT 
GTTCCTGTCC 
CTGAAGATAC 
CTTTTCTTAC 
TCTCTTTGCG 
GCAACGGCCT 
GCCCACCTCT 
TTACCTCAGA 
TCACCATGCA 
AAGGACCCCT 
CCACCGATAG 
GCTTGGGCAT 
ACGGGGCTCC 
GTGTGACTAT 
CACAAGGCAA 
GCCTTATACT 
GACAGGGCCC 
TTTACTTGTT 
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TGCAGCCTCA 


29940 


TTTGCATATC 


30000 


ATTCTTTAAT 


30060 


TTTGTTCCCC 


30120 


ATATTCCAAG 


30180 


TCTCTGTTAT 


30240 


TTGGCTGGAA 


30300 


CACTGCAACA 


30360 


CCACCCCCAC 


30420 


CTAGAAATGG 


30480 


GAGCAACAGC 


30540 


GGTATCTTTT 


30600 


CGCCTTAGCT 


30660 


CCCATTACCA 


30720 


GGACCTGAGG 


30780 


TTTAACTAAT 


30840 


GTCCAGTTTA 


30900 


CCTGGCTGCA 


30960 


ATCCGCACCC 


31020 


CTTCAACCCC 


31080 


TCCTCCCTTT 


31140 


CCTATCCGAA 


31200 


CTCTCTGGAC 


31260 


CAAAAAAACC 


31320 


AGCCCTAACT 


31380 


ATCACAGGCC 


31440 


CACAGTGTCA 


31500 


CAGTACCCTT 


31560 


TGACTTGAAA 


31620 


TTTGCATGTA 


31680 


TAATAATACT 


31740 


TATGCAACTT 


31800 


TGATGTTAGT 


31860 


TCTTTTTATA 


31920 


TACAGCTTCA 


31980 
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AACAA7TCCA AAAAGCTTGA GGTTAACCTA AGCACTGCCA AGGGGTTGAT GTTTGACGCT 32040 
ACAGCCATAG CCATTAATGC AGGAGATGGG CTTGAATTTG GTTCACCTAA TGCACCAAAC 32100 
ACAAATCCCC TCAAAACAAA AATTGGCCAT GGCCTAGAAT TTGATTCAAA CAAGGCTATG 32160 
GTTCCTAAAC TAGGAACTGG CCTTAGTTTT GACAGCACAG GTGCCATTAC AGTAGGAAAC 
AAAAATAATG ATAAGCTAAC TTTGTGGACC ACACCAGCTC CATCTCCTAA CTGTAGACTA 
AATGCAGAGA AAGATGCTAA ACTCACTTTG GTCTTAACAA AATGTGGCAG TCAAATACTT 
GCTACAGTTT CAGTTTTGGC TGTTAAAGGC AGTTTGGCTC CAATATCTGG AACAGTTCAA 
AGTGCTCATC TTATTATAAG ATTTGACGAA AATGGAGTGC TACTAAACAA TTCCTTCCTG 
GACCCAGAAT ATTGGAACTT TAGAAATGGA GATCTTACTG AAGGCACAGC CTATACAAAC 32520 
GCTGTTGGAT TTATGCCTAA CCTATCAGCT TATCCAAAAT CTCACGGTAA AACTGCCAAA 32580 
AGTAACATTG TCAGTCAAGT TTACTTAAAC GGAGACAAAA CTAAACCTGT AACACTAACC 
ATTACACTAA ACGGTACACA GGAAACAGGA GACACAACTC CAAGTGCATA CTCTATGTCA 
TTTTCATGGG ACTGGTCTGG CCACAACTAC ATTAATGAAA TATTTGCCAC ATCCTCTTAC 
ACTTTTTCAT ACATTGCCCA AGAATAAAGA ATCGTTTGTG TTATGTTTCA ACGTGTTTAT 32820 
TTTTCAATTG CAGAAAATTT CAAGTCATTT TTCATTCAGT AGTATAGCCC CACCACCACA 32880 
TAGCTTATAC AGATCACCGT ACCTTAATCA AACTCACAGA ACCCTAGTAT TCAACCTGCC 32940 
ACCTCCCTCC CAACACACAG AGTACACAGT CCTTTCTCCC CGGCTGGCCT TAAAAAGCAT 33000 
CATATCATGG GTAACAGACA TATTCTTAGG TGTTATATTC CACACGGTTT CCTGTCGAGC 33060 
CAAACGCTCA TCAGTGATAT TAATAAACTC CCCGGGCAGC TCACTTAAGT TCATGTCGCT 33120 
GTCCAGCTGC TGAGCCACAG GCTGCTGTCC AACTTGCGGT TGCTTAACGG GCGGCGAAGG 
AGAAGTCCAC GCCTACATGG GGGTAGAGTC ATAATCGTGC ATCAGGATAG GGCGGTGGTG 
CTGCAGCAGC GCGCGAATAA ACTGCTGCCG CCGCCGCTCC GTCCTGCAGG AATACAACAT 
GGCAGTGGTC TCCTCAGCGA TGATTCGCAC CGCCCGCAGC ATAAGGCGCC TTGTCCTCCG 33360 
GGCACAGCAG CGCACCCTGA TCTCACTTAA ATCAGCACAG TAACTGCAGC ACAGCACCAC 33420 
AATATTGTTC AAAATCCCAC AGTGCAAGGC GCTGTATCCA AAGCTCATGG CGGGGACCAC 33480 
AGAACCCACG TGGCCATCAT ACCACAAGCG CAGGTAGATT AAGTGGCGAC CCCTCATAAA 33540 
CACGCTGGAC ATAAACATTA CCTCTTTTGG CATGTTGTAA TTCACCACCT CCCGGTACCA 33600 
TATAAACCTC TGATTAAACA TGGCGCCATC CACCACCATC CTAAACCAGC TGGCCAAAAC 33660 
CTGCCCGCCG GCTATACACT GCAGGGAACC GGGACTGGAA CAATGACAGT GGAGAGCCCA 33720 
GGACTCGTAA CCATGGATCA TCATGCTCGT CATGATATCA ATGTTGGCAC AACACAGGCA 33780 
CACGTGCATA CACTTCCTCA GGATTACAAG CTCCTCCCGC GTTAGAACCA TATCCCAGGG 33840 
AACAACCCAT TCCTGAATCA GCGTAAATCC CACACTGCAG GGAAGACCTC GCACGTAACT 33900 
CACGTTGTGC ATTGTCAAAG TGTTACATTC GGGCAGCAGC GGATGATCCT CCAGTATGGT 33960 
AGCGCGGGTT TCTGTCTCAA AAGGAGGTAG ACGATCCCTA CTGTACGGAG TGCGCCGAGA 34020 
CAACCGAGAT CGTGTTGGTC GTAGTGTCAT GCCAAATGGA ACGCCGGACG TAGTCATATT 



33180 
33240 
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TCCTGAAGCA 


AAACCAGGTG 


CGGGCGTGAC 


AAACAGATCT 


GCGTCTCCGG 


TCTCGCCGC7 


34140 


TAGATCGCTC 


TGTGTAGTAG 


TTGTAGTATA 


TCCACTCTCT 


CAAAGCATCC 


AGGCGCCCCC 


34200 


TGGC7TCGGG 


TTCTATGTAA 


ACTCCTTCAT 


GCGCCGCTGC 


CCTGATAACA 


TCCACCACCG 


34260 


CAGAATAAGC 


CACACCCAGC 


CAACCTACAC 


ATTCGTTCTG 


CGAGTCACAC 


ACGGGAGGAG 


34320 


CGGGAAGAGC 


TGGAAGAACC 


ATGTTTTTTT 


TTTTATTCCA 


AAAGATTATC 


CAAAACCTCA 


34380 


AAATGAAGAT 


CTATTAAGTG 


AACGCGCTCC 


CCTCCGGTGG 


CGTGGTCAAA 


CTCTACAGCC 


34440 


AAAGAACAGA 


TAATGGCATT 


TGTAAGATGT 


TGCACAATGG 


CTTCCAAAAG 


GCAAACGGCC 


34500 


CTCACGTCCA 


AGTGGACGTA 


AAGGCTAAAC 


CCTTCAGGGT 


GAATCTCCTC 


TATAAACATT 


34560 


CCAGCACCTT 


CAACCATGCC 


CAAATAATTC 


TCATCTCGCC 


ACCTTCTCAA 


TATATCTCTA 


34620 


AGCAAATCCC 


GAATATTAAG 


TCCGGCCATT 


GTAAAAATCT 


GCTCCAGAGC 


GCCCTCCACC 


34680 


TTCAGCCTCA 


AGCAGCGAAT 


CATGATTGCA 


AAAATTCAGG 


TTCCTCACAG 


ACCTGTATAA 


34740 


GATTCAAAAG 


CGGAACATTA 


ACAAAAATAC 


CGCGATCCCG 


TAGGTCCCTT 


CGCAGGGCCA 


34800 


GCTGAACATA 


ATCGTGCAGG 


TCTGCACGGA 


CCAGCGCGGC 


CACTTCCCCG 


CCAGGAACCT 


34860 


TGACAAAAGA 


ACCCACACTG 


ATTATGACAC 


GCATACTCGG 


AGCTATGCTA 


ACCAGCGTAG 


34920 


CCCCGATGTA 


AGCTTTGTTG 


CATGGGCGGC 


GATATAAAAT 


GCAAGGTGCT 


GCTCAAAAAA 


34980 


TCAGGCAAAG 


CCTCGCGCAA 


AAAAGAAAGC 


ACATCGTAGT 


CATGCTCATG 


CAGATAAAGG 


35040 


CAGGTAAGCT 


CCGGAACCAC 


CACAGAAAAA 


GACACCATTT 


TTCTCTCAAA 


CATGTCTGCG 


35100 


GGTTTCTGCA 


TAAACACAAA 


ATAAAATAAC 


AAAAAAACAT 


TTAAACATTA 


GAAGCCTGTC 


35160 


TTACAACAGG 


AAAAACAACC 


CTTATAAGCA 


TAAGACGGAC 


TACGGCCATG 


CCGGCGTGAC 


35220 


CGTAAAAAAA 


CTGGTCACCG 


TGATTAAAAA 


GCACCACCGA 


CAGCTCCTCG 


GTCATGTCCG 


35280 


GAGTCATAAT 


GTAAGACTCG 


GTAAACACAT 


CAGGTTGATT 


CATCGGTCAG 


TGCTAAAAAG 


35340 


CGACCGAAAT 


AGCCCGGGGG 


AATACATACC 


CGCAGGCGTA 


GAGACAACAT 


TACAGCCCCC 


35400 


ATAGGAGGTA 


TAACAAAATT 


AATAGGAGAG 


AAAAACACAT 


AAACACCTGA 


AAAACCCTCC 


35460 


TGCCTAGGCA 


AAATAGCACC 


CTCCCGCTCC 


AGAACAACAT 


ACAGCGCTTC 


ACAGCGGCAG 


35520 


CCTAACAGTC 


AGCCTTACCA 


GTAAAAAAGA 


AAACCTATTA 


AAAAAACACC 


ACTCGACACG 


35580 


GCACCAGCTC 


AATCAGTCAC 


AGTGTAAAAA 


AGGGCCAAGT 


GCAGAGCGAG 


TATATATAGG 


35640 


ACTAAAAAAT 


GACGTAACGG 


TTAAAGTCCA 


CAAAAAACAC 


CCAGAAAACC 


GCACGCGAAC 


35700 


CTACGCCCAG 


AAACGAAAGC 


CAAAAAACCC 


ACAACTTCCT 


CAAATCGTCA 


CTTCCGTTTT 


35760 


CCCACGTTAC 


GTAACTTCCC 


ATTTTAAGAA 


AACTACAATT 


CCCAACACAT 


ACAAGTTACT 


35820 


CCGCCCTAAA 


ACCTACGTCA 


CCCGCCCCGT 


TCCCACGCCC 


CGCGCCACGT 


CACAAACTCC 


35880 


ACCCCCTCAT 


TATCATATTG 


GCTTCAATCC 


AAAATAAGGT 


ATATTATTGA 


TGATG 


35935 


(2) INFORMATION FOR SEQ ID NO : 2 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION : /desc * "DNA" 

Ui) SEQUENCE DESCRIPTION : SEQ ID NO: 2: 

TTCATTTTAT GTTTCAGGTT CAGGG 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

M ?^™ PE: ° ther nucleic *cid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

TTACCGCCAC ACTCGCAGGG 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH : 34303 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TTATTTTGGA TTGAAGCCAA TATGATAATG AGGGGGTGGA 

CGTGGGAACG GGGCGGGTGA CGTAGTAGTG TGGCGGAAGT 

GAACACATGT AAGCGACGGA TGTGGCAAAA GTGACGTTTT 

GGACGGGTGT GGTCGCCATG ATCGCGTAGT CGATAGTGGC 

GGACTGGGCG GCGGCCAAAG CGGTCGGACA GTGCTCCGAG 

GCATCAACGC ATATAGCGCT AGCAGCACGC CATAGTGACT 

CGATATCCCG CAAGAGGCCC GGCAGTACCG GCATAACCAA 

GGGTGACGGT GCCGAGGATG ACGATGAGCG CATTGTTAGA 

TGCGTTAGCA ATTTAACTGT GATAAACTAC CGCATTAAAG 

AAACATGAGA ATTCTTGAAG ACGAAAGGGC CTCGTGATAC 

GTCATGATAA TAATGGTTTC TTAGACGTCA GGTGGCACTT 

ACCCCTATTT GTTTATTTTT CTAAATACAT TCAAATATGT 

CCCTGATAAA TGCTTCAATA ATATTGAAAA AGGAAGAGTA 

GTCGCCCTTA TTCCCTTTTT TGCGGCATTT TGCCTTCCTG 

CTGGTGAAAG TAAAAGATGC TGAAGATCAG TTGGGTGCAC 

GATCTCAACA GCGGTAAGAT CCTTGAGAGT TTTCGCCCCG 

AGCACTTTTA AAGTTCTGCT ATGTGGCGCG GTATTATCCC 

CAACTCGGTC GCCGCATACA CTATTCTCAG AATGACTTGG 
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GTTTGTGACG 
GTGATGTTGC 
TGGTGTGCGC 
TCCAAGTAGC 
AACGGGTGCG 
GGCGATGCTG 
GCCTATGCCT 
TTTCATACAC 
CTTATCGATG 
GCCTATTTTT 
TTCGGGGAAA 
ATCCGCTCAT 
TGAGTATTCA 
TTTTTGCTCA 
GAGTGGGTTA 
AAGAACGTTT 
GTGTTGACGC 
TTGAGTACTC 



TGGCGCGGGG 
AAGTGTGGCG 
CGGATCCACA 
GAAGCGAGCA 
CATAGAAATT 
TCGGAATGGA 
ACAGCATCCA 
GGTGCCTGAC 
ATAAGCTGTC 
ATAGGTTAAT 
TGTGCGCGGA 
GAGACAATAA 
ACATTTCCGT 
CCCAGAAACG 
CATCGAACTG 
TCCAATGATG 
CGGGCAAGAG 
ACCAGTCACA 



20 



60 
120 
180 
240 
300 
360 
420 
48C 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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GAAAAGC.VTC 


7TACGGATGG 


CATGACAGTA 


AGAGAATTAT 


GCAGTGCTGC 


CATAACCATG 


1140 


AGTGA7AACA 


CTGCGGCCAA 


CTTACTTCTG 


ACAACGATCG 


GAGGACCGAA 


GGAGCTAACC 


1200 


GCTTTT77GC 


ACAACATGGG 


GGATCATGTA 


ACTCGCCTTG 


ATCG7TGGGA 


ACCGGAGCTG 


1260 


AATGAAGCCA 


TACCAAACGA 


CGAGCGTGAC 


ACCACGATGC 


CTGCAGCAAT 


GGCAACAACG 


1320 


TTGCGCAAAC 


TATTAACTGG 


CGAACTACTT 


ACTCTAGCTT 


CCCGGCAACA 


A7TAATAGAC 


1380 


TGGATGGAGG 


CGGATAAAGT 


TGCAGGACCA 


CTTCTGCGCT 


CGGCCCTTCC 


GGCTGGCTGG 


1440 


TTTATTGCTG 


ATAAATCTGG 


AGCCGGTGAG 


CG7GGG7CTC 


GCGGTATCAT 


TGCAGCACTG 


1500 


GGGCCAGATG 


G7AAGCCCTC 


CCGTATCGTA 


GTTATCTACA 


CGACGGGGAG 


TCAGGCAACT 


1560 


ATGGATGAAC 


GAAATAGACA 


GATCGCTGAG 


ATAGGTGCCT 


CACTGATTAA 


GCATTGGTAA 


1620 


CTGTCAGACC 


AAGTTTACTC 


ATATATACTT 


TAGA7TGATT 


TAAAACTTCA 


TTTTTAATTT 


1680 


AAAAGGATCT 


AGGTGAAGAT 


CCTTTTTGAT 


AATCTCATGA 


CCAAAATCCC 


TTAACGTGAG 


1740 


TTTTCGTTCC 


ACTGAGCGTC 


AGACCCCGTA 


GAAAAGATCA 


AAGGATCTTC 


TTGAGATCCT 


1800 


TTTTTTCTGC 


GCGTAATCTG 


CTGCTTGCAA 


ACAAAAAAAC 


CACCGCTACC 


AGCGGTGGTT 


1860 


TGTTTGCCGG 


ATCAAGAGCT 


ACCAACTCTT 


TTTCCGAAGG 


TAACTGGCTT 


CAGCAGAGCG 


1920 


CAGATACCAA 


ATAC7GTCCT 


TCTAGTGTAG 


CCGTAGTTAG 


GCCACCACTT 


CAAGAACTCT 


1980 


GTAGCACCGC 


CTACATACCT 


CGCTCTGCTA 


ATCCTGTTAC 


CAGTGGCTGC 


TGCCAGTGGC 


2040 


GATAAGTCGT 


GTCTTACCGG 


GTTGGACTCA 


AGACGATAGT 


TACCGGATAA 


GGCGCAGCGG 


2100 


TCGGGCTGAA 


CGGGGGGTTC 


GTGCACACAG 


CCCAGCTTGG 


AGCGAACGAC 


CTACACCGAA 


2160 


CTGAGATACC 


TACAGCGTGA 


GCATTGAGAA 


AGCGCCACGC 


TTCCCGAAGG 


GAGAAAGGCG 


2220 


GACAGGTATC 


CGGTAAGCGG 


CAGGGTCGGA 


ACAGGAGAGC 


GCACGAGGGA 


GCTTCCAGGG 


2280 


GGAAACGCCT 


GGTATCTTTA 


TAGTCCTGTC 


GGGTTTCGCC 


ACCTCTGACT 


TGAGCGTCGA 


2340 


TTTTTGTGAT 


GCTCGTCAGG 


GGGGCGGAGC 


CTATGGAAAA 


ACGCCAGCAA 


CGCGGCCTTT 


2400 


TTACGG77CC 


TGGCCTTTTG 


CTGGCCTTTT 


GCTCACATGT 


TCTTTCCTGC 


GTTATCCCCT 


2460 


GATTCTG7GG 


ATAACCGTAT 


TACCGCCTTT 


GAGTGAGCTG 


ATACCGCTCG 


CCGCAGCCGA 


2520 


ACGACCGAGC 


GCAGCGAGTC 


AGTGAGCGAG 


GAAGCGGAAG 


AGCGCCTGAT 


GCGGTATTTT 


2580 


CTCCTTACGC 


ATCTGTGCGG 


TATTTCACAC 


CGCATATGGT 


GCACTCTCAG 


TACAATCTGC 


2640 


TCTGATGCCG 


CATAGTTAAG 


CCAGTATACA 


CTCCGCTATC 


GCTACGTGAC 


TGGGTCATGG 


2700 


CTGCGCCCCG 


ACACCCGCCA 


ACACCCGCTG 


ACGCGCCCTG 


ACGGGCTTGT 


CTGCTCCCGG 


2760 


CATCCGC7TA 


CAGACAAGCT 


GTGACCGTCT 


CCGGGAGCTG 


CATGTGTCAG 


AGGTTTTCAC 


2820 


CGTCATCACC 


GAAACGCGCG 


AGGCAGTCTA 


GACAATAGTA 


GTACGGATAG 


CTGTGACTCC 


2880 


GGTCCT7CTA 


ACACACCTCC 


TGAGATACAC 


CCGGTGGTCC 


CGCTGTGCCC 


CATTAAACCA 


2940 


GTTGCCG7GA 


GAGTTGGTGG 


GCGTCGCCAG 


GCTGTGGAAT 


GTATCGAGGA 


CTTGCTTAAC 


3000 


GAGCCTGGGC 


AACCTTTGGA 


CTTGAGCTGT 


AAACGCCCCA 


GGCCATAAGG 


TGTAAACCTG 


3060 


TGATTGCGTG 


TGTGGTTAAC 


GCCTTTGTTT 


GCTGAATGAG 


TTGATGTAAG 


TTTAATAAAG 


3120 


GGTGAGATAA 


TGTTTAACTT 


GCATGGCGTG 


TTAAATGGGG 


CGGGGCTTAA 


AGGGTATATA 


3180 
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ATGCGCCGTG GGCTAATCTT GGTTACATCT GACCTCATGG AGGCTTGGGA GTGTTTGGAA 3240 
GATTT7TCTG CTGTGCGTAA CTTGCTGGAA CAGAGCTCTA ACAGTACCTC TTGGTTTTGG 
AGGTTTCTGT GGGGCTCATC CCAGGCAAAG TTAGTCTGCA GAATTAAGGA GGATTACAAG 
TGGGAATTTG AAGAGCTTTT GAAATCCTGT GGTGAGCTGT TTGATTCTTT GAATCTGGGT 
CACCAGGCGC TTTTCCAAGA GAAGGTCATC AAGACTTTGG ATTTTTCCAC ACCGGGGCGC 
GCTGCGGCTG CTGTTGCTTT TTTGAGTTTT ATAAAGGATA AATGGAGCGA AGAAACCCAT 
CTGAGCGGGG GGTACCTGCT GGATTTTCTG GCCATGCATC TGTGGAGAGC GGTTGTGAGA 
CACAAGAATC GCCTGCTACT GTTGTCTTCC GTCCGCCCGG CGATAATACC GACGGAGGAG 
CAGCAGCAGC AGCAGGAGGA AGCCAGGCGG CGGCGGCAGG AGCAGAGCCC ATGGAACCCG 
AGAGCCGGCC TGGACCCTCG GGAATGAATG TTGTACAGGT GGCTGAACTG TATCCAGAAC 
TGAGACGCAT TTTGACAATT ACAGAGGATG GGCAGGGGCT AAAGGGGGTA AAGAGGGAGC 
GGGGGGCTTG TGAGGCTACA GAGGAGGCTA GGAATCTAGC TTTTAGCTTA ATGACCAGAC 
ACCGTCCTGA GTGTATTACT TTTCAACAGA TCAAGGATAA TTGCGCTAAT GAGCTTGATC 
TGCTGGCGCA GAAGTATTCC ATAGAGCAGC TGACCACTTA CTGGCTGCAG CCAGGGGATG 
ATTTTGAGGA GGCTATTAGG GTATATGCAA AGGTGGCACT TAGGCCAGAT TGCAAGTACA 
AGATCAGCAA ACTTGTAAAT ATCAGGAATT GTTGCTACAT TTCTGGGAAC GGGGCCGAGG 
TGGAGATAGA TACGGAGGAT AGGGTGGCCT TTAGATGTAG CATGATAAAT ATGTGGCCGG 
GGGTGCTTGG CATGGACGGG GTGGTTATTA TGAATGTAAG GTTTACTGGC CCCAATTTTA 
GCGGTACGGT TTTCCTGGCC AATACCAACC TTATCCTACA CGGTGTAAGC TTCTATGGGT 
TTAACAATAC CTGTGTGGAA GCCTGGACCG ATGTAAGGGT TCGGGGCTGT GCCTTTTACT 
GCTGCTGGAA GGGGGTGGTG TGTCGCCCCA AAAGCAGGGC TTCAATTAAG AAATGCCTCT 
TTGAAAGGTG TACCTTGGGT ATCCTGTCTG AGGGTAACTC CAGGGTGCGC CACAATGTGG 
CCTCCGACTG TGGTTGCTTC ATGCTAGTGA AAAGCGTGGC TGTGATTAAG CATAACATGG 
TATGTGGCAA CTGCGAGGAC AGGGCCTCTC AGATGCTGAC CTGCTCGGAC GGCAACTGTC 
ACCTGCTGAA GACCATTCAC GTAGCCAGCC ACTCTCGCAA GGCCTGGCCA GTGTTTGAGC 
ATAACATACT GACCCGCTGT TCCTTGCATT TGGGTAACAG GAGGGGGGTG TTCCTACCTT 
ACCAATGCAA TTTGAGTCAC ACTAAGATAT TGCTTGAGCC CGAGAGCATG TCCAAGGTGA 
ACCTGAACGG GGTGTTTGAC ATGACCATGA AGATCTGGAA GGTGCTGAGG TACGATGAGA 
CCCGCACCAG GTGCAGACCC TGCGAGTGTG GCGGTAAACA TATTAGGAAC CAGCCTGTGA 
TGCTGGATGT GACCGAGGAG CTGAGGCCCG ATCACTTGGT GCTGGCCTGC ACCCGCGCTG 
AGTTTGGCTC TAGCGATGAA GATACAGATT GAGGTACTGA AATGTGTGGG CGTGGCTTAA 
GGGTGGGAAA GAATATATAA GGTGGGGGTC TTATGTAGTT TTGTATCTGT TTTGCAGCAG 
CCGCCGCCGC CATGAGCACC AACTCGTTTG ATGGAAGCAT TGTGAGCTCA TATTTGACAA 
CGCGCATGCC CCCATGGGCC GGGGTGCGTC AGAATGTGAT GGGCTCCAGC ATTGATGGTC 
GCCCCGTCCT GCCCGCAAAC TCTACTACCT TGACCTACGA GACCGTGTCT GGAACGCCGT 
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TGGAGACTGC 


AGCCTCCGCC 


GCCGCTTCAG 


CCGCTGCAGC 


CACCGCCCGC 


GGGATTGTGA 


5340 


CTGACTTTGC 


TTTCCTGAGC 


CCGCTTGCAA 


GCAGTGCAGC 


T7CCCGTTCA 


TCCGCCCGCG 


5400 


ATGACAAGTT 


GACGGCTCTT 


TTGGCACAAT 


TGGATTCTTT 


GACCCGGGAA 


CTTAATGTCG 


5460 


TTTCTCAGCA 


GCTGTTGGAT 


CTGCGCCAGC 


AGGTTTCTGC 


CCTGAAGGCT 


TCCTCCCCTC 


5520 


CCAATGCGGT 


TTAAAACATA 


AATAAAAAAC 


CAGACTCTGT 


TTGGATTTGG 


ATCAAGCAAG 


5580 


TGTCTTGCTG 


TCTTTATTTA 


GGGGTTTTGC 


GCGCGCGGTA 


GGCCCGGGAC 


CAGCGGTCTC 


5640 


GGTCGTTGAG 


GGTCCTGTGT 


ATTTTTTCCA 


GGACGTGGTA 


AAGGTGACTC 


TGGATGTTCA 


5700 


GATACATGGG 


CATAAGCCCG 


TCTCTGGGGT 


GGAGGTAGCA 


CCACTGCAGA 


GCTTCATGCT 


5760 


GCGGGGTGGT 


GTTGTAGATG 


ATCCAGTCGT 


AGCAGGAGCG 


CTGGGCGTGG 


TGCCTAAAAA 


5820 


TGTCTTTCAG 


TAGCAAGCTG 


ATTGCCAGGG 


GCAGGCCCTT 


GGTGTAAGTG 


TTTACAAAGC 


5880 


GGTTAAGCTG 


GGATGGGTGC 


ATACGTGGGG 


ATATGAGATG 


CATCTTGGAC 


TGTATTTTTA 


5940 


GGTTGGCTAT 


GTTCCCAGCC 


ATATCCCTCC 


GGGGATTCAT 


GTTGTGCAGA 


ACCACCAGCA 


6000 


CAGTGTATCC 


GGTGCACTTG 


GGAAATTTGT 


CATGTAGCTT 


AGAAGGAAAT 


GCGTGGAAGA 


6060 


ACTTGGAGAC 


GCCCTTGTGA 


CCTCCAAGAT 


TTTCCATGCA 


TTCGTCCATA 


ATGATGGCAA 


6120 


TGGGCCCACG 


GGCGGCGGCC 


TGGGCGAAGA 


TATTTCTGGG 


ATCACTAACG 


TCATAGTTGT 


6180 


GTTCCAGGAT 


GAGATCGTCA 


TAGGCCATTT 


TTACAAAGCG 


CGGGCGGAGG 


GTGCCAGACT 


6240 


GCGGTATAAT 


GGTTCCATCC 


GGCCCAGGGG 


CGTAGTTACC 


CTCACAGATT 


TGCATTTCCC 


6300 


ACGCTTTGAG 


TTCAGATGGG 


GGGATCATGT 


CTACCTGCGG 


GGCGATGAAG 


AAAACGGTTT 


6360 


CCGGGGTAGG 


GGAGATCAGC 


TGGGAAGAAA 


GCAGGTTCCT 


GAGCAGCTGC 


GACTTACCGC 


6420 


AGCCGGTGGG 


CCCGTAAATC 


ACACCTATTA 


CCGGGTGCAA 


CTGGTAGTTA 


AGAGAGCTGC 


6480 


AGCTGCCGTC 


ATCCCTGAGC 


AGGGGGGCCA 


CTTCGTTAAG 


CATGTCCCTG 


ACTCGCATGT 


6540 


TTTCCCTGAC 


CAAATCCGCC 


AGAAGGCGCT 


CGCCGCCCAG 


CGATAGCAGT 


TCTTGCAAGG 


6600 


AAGCAAAGTT 


TTTCAACGGT 


TTGAGACCGT 


CCGCCGTAGG 


CATGCTTTTG 


AGCGTTTGAC 


6660 


CAAGCAGTTC 


CAGGCGGTCC 


CACAGCTCGG 


TCACCTGCTC 


TACGGCATCT 


CGATCCAGCA 


6720 


TATCTCCTCG 


TTTCGCGGGT 


TGGGGCGGCT 


TTCGCTGTAC 


GGCAGTAGTC 


GGTGCTCGTC 


6780 


CAGACGGGCC 


AGGGTCATGT 


CTTTCCACGG 


GCGCAGGGTC 


CTCGTCAGCG 


TAGTCTGGGT 


6840 


CACGGTGAAG 


GGGTGCGCTC 


CGGGCTGCGC 


GCTGGCCAGG 


GTGCGCTTGA 


GGCTGGTCCT 


6900 


GCTGGTGCTG 


AAGCGCTGCC 


GGTCTTCGCC 


CTGCGCGTCG 


GCCAGGTAGC 


ATTTGACCAT 


6960 


GGTGTCATAG 


TCCAGCCCCT 


CCGCGGCGTG 


GCCCTTGGCG 


CGCAGCTTGC 


CCTTGGAGGA 


7020 


GGCGCCGCAC 


GAGGGGCAGT 


GCAGACTTTT 


GAGGGCGTAG 


AGCTTGGGCG 


CGAGAAATAC 


7080 


CGATTCCGGG 


GAGTAGGCAT 


CCGCGCCGCA 


GGCCCCGCAG 


ACGGTCTCGC 


ATTCCACGAG 


7140 


CCAGGTGAGC 


TCTGGCCGTT 


CGGGGTCAAA 


AACCAGGTTT 


CCCCCATGCT 


TTTTGATGCG 


7200 


TTTCTTACCT 


CTGGTTTCCA 


TGAGCCGGTG 


TCCACGCTCG 


GTGACGAAAA 


GGCTGTCCGT 


7260 


GTCCCCGTAT 


ACAGACTTGA 


GAGGCCTGTC 


CTCGAGCGGT 


GTTCCGCGGT 


CCTCCTCGTA 


7320 


TAGAAACTCG 


GACCACTCTG 


AGACAAAGGC 


TCGCGTCCAG 


GCCAGCACGA 


AGGAGGCTAA 


7380 
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GTGGGAGGGG 


TAGCGGTPGT 


THTPPZlPTAP 


GGGGTCCACT CGCTCCAGGG 


TPTP A A P T\ TV 

1 G I G AAGACA 


7440 


CATGTCGCCC 


TCTTCGGPAT 


PH AP.P. Il BPPT 
\_AALsoAAo\j I 


GATTGGTTTG 


TAGGTGTAGG 


PPAPPTP7\ P^ 

GLALGTGACC 


7500 


GGGTGTTCCT 


GAAGGGGP^GP 


TIVT7i!\ R Appp 
1 A 1 AAAAoVjo 


GGTGGGGGCG 


CGTTCGTCCT 


P A PTPIf^HW/^ 

LAGTCTCTTC 


7560 


CGCATCGCTG 


TCTGCGAGGG 


rm^pTPTTp 


GGGTGAGTAC 


TCCCTCTGAA 


A Appppor^ikn 1 * 

AAGLGGGCAT 


7620 


GACTTCTGCG 


CTAAGATTGT 


P7iP.TTTPPii n 

v-Au 111 LLAH 


AAACGAGGAG 


GATTTPIiTZiT 

i. J. 1 L3>\ 1 A 1 


TCACCTGGCC 


7680 


CGCGGTGATG 


V- L_ A X X OALauLa 


'PPP/^^O 7\ rp /"^ 

lobLLGCATC 


CATCTGGTCA 


fiAAAAPZiPZiZi 
Ortrt/trtLa A L AA 


TCTTTTTGTT 


7740 


GTCAAGCTTG 


PTPPP a. a a rr 


ACCCGTAGAG 


GGCGTTGGAP 


A PIP A A P TTP P 
ALjLAAL. I 1 GG 


CGATGGAGCG 


7800 


CAGGGTTTGG 


TTTTTPTPPP 
i 1 a x 1 La l L.uL_ 


GATCGGCGCG 


CTCCTTGGPP 


P PP 7A TP TTT A 
LaL-LaA X La 1 I I A 


GCTGCACGTA 


7860 


TTCGCGCGPA 




ATTCGGGAAA 


GACGGTGGTG 


LGG 1 GG X LGG 


GCACCAGGTG 


7920 


CACGCGPPAA 


LLLaLLaLai IGx 


GCAGGGTGAC 


AAGGTPAAPP, 


pTf PTP pr"TTl 

L. 1GG I GGLTA 


CCTCTCCGCG 


7980 


TAGGCGPTPG 


TTPPTPP a pr 
A A LjLa J. LLAul 


AGAGGCGGCC 


GCCCTTGCGP 


PZiPIPAPA A TP 
LaALaLAGAA 1 G 


GCGGTAGGGG 


8040 


GTCTAPPTPP 


P,TPTPPTPPP 
La a L 1 GG 1 LLG 


GGGGGTCTGC 


GTCCACGfiTA 

^ 1 WWrtWUL3 1 


Zi ap a r^r* fT^rr* 

AAGAGGCCGG 


GCAGCAGGCG 


8100 


CGCGTCGAAG 

WWW\J J V\J/V1\J 


TAPTPTZiTP'T 
A A La 1 LI 


I GCATCCTTG 


CAAGTPTAnP 

v-rtr4V3 X W X J^L3L_ 


PPPTP PTP P P 

GGG IGL7GCC 


ATGCGCGGGC 


8160 


GGCAAGCGPP 


PPPTPPTJ\TP 
L.LaL- J. LG 1 A I G 


GGTTGAGTGG 




LaGGAl GGGGT 


GGGTGAGCGC 


8220 


GGAGGPGTAP 


M i LaGLGLAAA 


TGTCGTAAAC 


GTAP.AnP.P.P.P 


TPPTPTPP TV /""T" Tl 

lLiLi GAGTA 


TTCCAAGATA 


8280 




P A TPTTPP A P 


CGCGGATGCT 


P.f^PPPP.PZiPP 


1 AATCGTATA 


GTTCGTGCGA 


8340 


GGG7XGCG UPP 


a nr* tt* it* r* a p 
AGG 1 LbbbAL 


CGAGGTTGCT 


AL.0LjL3L.GGGw 


TGCTCTGCTC 


GGAAGACTAT 


8400 


ptppp^pi ap 


A TP p p * rpp T>P 

A i. GGGATGTG 


AGTTGGATGA 


TZlTP^P-TTPP A 
1 A 1 LjLa 1 1 GGA 


CGCTGGAAGA 


CGTTGAAGCT 


8460 


pppptptptp 


A P A P ottv fr*r* 

AGACCTACCG 


CGTCACGCAC 


PUZlPPZiPPPP 
Ls/irt L5 La AVj La L. La 


TAGGAGTCGC 


GC AG CTTGTT 


8520 


Unt^/iow i L La 


GLGGTGACCT 


GCACGTCTAG 


PP PP P Zi PT Ti P 
LruLuLHu 1 AG 


I CCAGGGTTT 


CCTTGATGAT 


8580 




TPPTPTPrP r P 


TTTTTTTCCA 




TTP 7\ P P T\ T\ Ti 

1 x G AGG AC AA 


ACTCTTCGCG 


8640 


GTPT'T'TPP AP 


T a ptpttpp a 
a AL ILI 1GGA 


TCGGAAACCC 


GTCGGCCTCC 


Cl A A PPP T A AP 

LaAALGG i AAG 


AGCCTAGCAT 


8700 


GTAGAAPTPP, 


Ttp hpppprT 


GGTAGGCGCA 


GCATPPPTTT 


TPTTiPPPPTA 
1L1 ALLaGG I A 


GCGCGTATGC 


8760 


CTGCGCGGPP 


TTPPPP A P PP 


AGGTGTGGGT 


GAGCGCAAAG 


PTPTPPPTPZi 
uluiLLLl La A 


CCATGACTTT 


8820 


GAGGTACTGG 


TUTTTP ?A apt 
a A I X 1 LAAG I 


LAGTGTCGTC 


GCATCCGCCC 


TGPTPPPAPZi 
auli LLLnun 


GCAAAAAGTC 


8880 


CGTGCGCTTT 


TTPn A 7\ r»P PP 


GA1 1 TGGCAG 


GG CGAAGG TG 


APATPPTTPZi 
x wo x x On 


AGAGTATCTT 


8940 


TCCCGCGCGA 


nf5P AT7i Zi ZiPT 
*JOLrt 1 AAA La 1 


TP PP TPTT" T* T> 

1 bLGTGTGAT 


GCGGAAGGGT 


^LLUUL/iLL 1 


CGGAACGGTT 


9000 


GTTAATTACC 


TGGGmnm a 

i OVJO\_\JV3\-OA 


PP7iPPT\ 'PPT'P 


GTCAAAGCCG 


TTGATPTTPT 


GGCCCACAAT 


9060 


GTAAAGTTCP 


A AP.Z1 7APPPPP 
AAL3 AAVj L» 0 L, O 


PP 7\ T^P P/*^^^m 

GGATGCCCTT 


GATGGAAGGP 


A ATTTTTTZi Zi 
rtrt I 1 X X I i AA 


GTTCCTCGTA 


9120 


GGTGAGCTCT 


TP Zi p-p^n lipp 


I GAGCCCGTG 


CTCTGAAAGG 


PPPPAPTPTP 

ULLLttU 1 L 1 u 


CAAGATGAGG 


9180 


GTTGPA appp 


AL (a AA I (j AGC 


TCCACAGGTC 


Zi PP PPPP7\ TT 

AL.GGGLLAI I 


AGCATTTGCA 


GGTGGTCGCG 


9240 


AAAGGTCCTA 


A ZiPTPPPPfi c 
aHL X ULjUoAL 


GTATGGCCAT 


TTTTTCTGGG 


GTGATGCAGT 


AGAAGGTAAG 


9300 


PPPPT^TTPT 


TCCCAGCGGT 


CCCATCCAAG 


GTTCGCGGCT 


AGGTCTCGCG 


CGGCAGTCAC 


9360 


TAGAGGCTCA 


TCTCCGCCGA 


ACTTCATGAC 


CAGCATGAAG 


GGCACGAGCT 


GCTTCCCAAA 


9420 


GGCCCCCATC 


CAAGTATAGG 


TCTCTACATC 


GTAGGTGACA 


AAGAGACGCT 


CGGTGCGAGG 


9480 
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ATGCGAGCCG 


ATCGGGAAGA 


ACTGGATCTC 


GTGGTGAAAG 


TAGAAGTCCC 


TGCGACGGGC 


TGCGCAGTAC 


TGGCAGCGGT 


GCACGGGCTG 


GCGCACAAGG 


AAGCAGAGTG 


GGAATTTGAG 


TTCTACTTCG 


GCTGCTTGTC 


CTTGACCGTC 


GACCACCACG 


CCGCGCGAGC 


CCAAAGTCCA 


GACAACATCG 


CGCAGATGGG 


AGCTGTCCAT 


CGGGAGCTCC 


TGCAGGTTTA 


CCTCGCATAG 


ATACGTAATT 


TCCAGGGGCT 


GGTTGGTGGC 


CCGCGGCGCG 


ACTACGGTAC 


CGCGCGGCGG 


TGCATCTAAA 


AGCGGTGACG 


CGGGCGAGCC 


GGGAGAGGGG 


GCAGGGGCAC 


GTCGGCGCCG 


AGGTTGCTGG 


CGAACGCGAC 


GACGCGGCGG 


AAGACGACGG 


GCCCGGTGAG 


CTTGAGCCTG 


TCGTTGACGG 


CGGCCTGGCG 


CAAAATCTCC 


ATCTCGGCCA 


TGAACTGCTC 


GATCTCTTCC 


ACGGTGGCGG 


CGAGGTCGTT 


GGAAATGCGG 


CCCTCGTTCC 


AGACGCGGCT 


GTAGACCACG 


ACCTGCGCGA 


GATTGAGCTC 


CACGTGCCGG 


AAGAGGTAGT 


TGAGGGTGGT 


GGCGGTGTGT 


CGCAACGTGG 


ATTCGTTGAT 


ATCCCCCAAG 


TCCACGGCGA 


AGTTGAAAAA 


CTGGGAGTTG 


AGACGGATGA 


GCTCGGCGAC 


AGTGTCGCGC 


TCTTCTTCTT 


CAATCTCCTC 


TTCCATAAGG 


GGGGGAGGGG 


GGACACGGCG 


GCGACGACGG 


ATCATCTCCC 


CGCGGCGACG 


GCGCATGGTC 


CGCAGTTGGA 


AGACGCCGCC 


CGTCATGTCC 


GGCAGGGATA 


CGGCGCTAAC 


GATGCATCTC 


AGGGACCTGA 


GCGAGTCCGC 


ATCGACCGGA 


CAGTCACAGT 


CGCAAGGTAG 


GCTGAGCACC 


TTGTTTCTGG 


CGGAGGTGCT 


GCTGATGATG 


ATGGTCGACA 


GAAGCACCAT 


GTCCTTGGGT 


ATGCCCCAGG 


CTTCGTTTTG 


ACATCGGCGC 


TCTACCGGCA 


CTTCTTCTTC 


TCCTTCCTCT 


GCGGCGGCGG 


AGTTTGGCCG 


TAGGTGGCGC 
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CCGCCACCAA 


TTGGAGGAGT 


GGCTATTGAT 


9540 


CGAACACTCG 


TGCTGGCTTT 


TGTAAAAACG 


*9600 


TACATCCTGC 


ACGAGGTTGA 


CCTGACGACC 


9660 


CCCCTCGCCT 


GGCGGGTTTG 


GCTGGTGGTC 


9720 


TGGCTGCTCG 


AGGGGAGTTA 


CGGTGGATCG 


9780 


GATGTCCGCG 


CGCGGCGGTC 


GGAGCTTGAT 


9840 


GGTCTGGAGC 


TCCCGCGGCG 


TCAGGTCAGG 


9900 


ACGGGTCAGG 


GCGCGGGCTA 


GATCCAGGTG 


9960 


GGCGTCGATG 


GCTTGCAAGA 


GGCCGCATCC 


10020 


GCGGTGGGCC 


GCGGGGGTGT 


CCTTGGATGA 


10080 


CCCGGAGGTA 


GGGGGGGCTC 


CGGACCCGCC 


10140 


CGCGCGGGCA 


GGAGCTGGTG 


CTGCGCGCGT 


10200 


TTGATCTCCT 


GAATCTGGCG 


CCTCTGCGTG 


10260 


AAAGAGAGTT 


CGACAGAATC 


AATTTCGGTG 


10320 


TGCACGTCTC 


CTGAGTTGTC 


TTGATAGGCG 


10380 


TCCTGGAGAT 


CTCCGCGTCC 


GGCTCGCTCC 


10440 


GCCATGAGCT 


GCGAGAAGGC 


GTTGAGGCCT 


10500 


CCCCCTTCGG 


CATCGCGGGC 


GCGCATGACC 


10560 


GCGAAGACGG 


CGTAGTTTCG 


CAGGCGCTGA 


10620 


TCTGCCACGA 


AGAAGTACAT 


AACCCAGCGT 


10680 


GCCTCAAGGC 


GCTCCATGGC 


CTCGTAGAAG 


10740 


CGCGCCGACA 


CGGTTAACTC 


CTCCTCCAGA 


10800 


ACCTCGCGCT 


CAAAGGCTAC 


AGGGGCCTCT 


10860 


GCCTCCCCTT 


CTTCTTCTTC 


TGGCGGCGGT 


10920 


CGCACCGGGA 


GGCGGTCGAC 


AAAGCGCTCG 


10980 


TCGGTGACGG 


CGCGGCCGTT 


CTCGCGGGGG 


11040 


CGGTTATGGG 


TTGGCGGGGG 


GCTGCCATGC 


11100 


AACAATTGTT 


GTGTAGGTAC 


TCCGCCGCCG 


11160 


TCGGAAAACC 


TCTCGAGAAA 


GGCGTCTAAC 


11220 


GTGGCGGGCG 


GCAGCGGGCG 


GCGGTCGGGG 


11280 


TAATTAAAGT 


AGGCGGTCTT 


GAGACGGCGG 


11340 


CCGGCCTGCT 


GAATGCGCAG 


GCGGTCGGCC 


11400 


AGGTCTTTGT 


AGTAGTCTTG 


CATGAGCCTT 


11460 


TGTCCTGCAT 


CTCTTGCATC 


TATCGCTGCG 


11520 


CCTCTTCCTC 


CCATGCGTGT 


GACCCCGAAG 


11580 
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CCCCTCATCG GCTGAAGCAG GGCTAGGTCG GCGACAACGC GCTCGGCTAA TATGGCCTGC^^^^ 



TGCACC7GCG TGAGGGTAGA CTGGAAGTCA TCCATGTCCA CAAAGCGGTG GTATGCGCCT 
GTGTTGATGG TGTAAGTGCA GTTGGCCATA ACGGACCAGT TAACGGTCTG GTGACCCGGC 
TGCGAGAGCT CGGTGTACCT GAGACGCGAG TAAGCCCTCG AGTCAAATAC GTAGTCGTTG 
CAAGTCCGCA CCAGGTACTG GTATCCCACC AAAAAGTGCG GCGGCGGCTG GCGGTAGAGG 
GGCCAGCGTA GGGTGGCCGG GGCTCCGGGG GCGAGATCTT CCAACATAAG GCGATGATAT 
CCGTAGATGT ACCTGGACAT CCAGGTGATG CCGGCGGCGG TGGTGGAGGC GCGCGGAAAG 
TCGCGGACGC GGTTCCAGAT GTTGCGCAGC GGCAAAAAGT GCTCCATGGT CGGGACGCTC 
TGGCCGGTCA GGCGCGCGCA ATCGTTGACG CTCTACCGTG CAAAAGGAGA GCCTGTAAGC 
GGGCACTCTT CCGTGGTCTG GTGGATAAAT TCGCAAGGGT ATCATGGCGG ACGACCGGGG 
TTCGAGCCCC GTATCCGGCC GTCCGCCGTG ATCCATGCGG TTACCGCCCG CGTGTCGAAC 
CCAGGTGTGC GACGTCAGAC AACGGGGGAG TGCTCCTTTT GGCTTCCTTC CAGGCGCGGC 
GGCTGCTGCG CTAGCTTTTT TGGCCACTGG CCGCGCGCAG CGTAAGCGGT TAGGCTGGAA 
AGCGAAAGCA TTAAGTGGCT CGCTCCCTGT AGCCGGAGGG TTATTTTCCA AGGGTTGAG"" 
CGCGGGACCC CCGGTTCGAG TCTCGGACCG GCCGGACTGC GGCGAACGGG GGTTTGCCTC 
CCCGTCATGC AAGACCCCGC TTGCAAATTC CTCCGGAAAC AGGGACGAGC CCCTTTTTTG 
CTTTTCCCAG ATGCATCCGG TGCTGCGGCA GATGCGCCCC CCTCCTCAGC AGCGGCAAGA 
GCAAGAGCAG CGGCAGACAT GCAGGCCACC CTCCCCTCCT CCTACCGCGT CAGGAGGGGC 
GACATCCGCG GTTGACGCGG CAGCAGATGG TGATTACGAA CCCCCGCGGC GCCGGGCCCG 
GCACTACCTG GACTTGGAGG AGGGCGAGGG CCTGGCGCGG CTAGGAGCGC CCTCTCCTGA 
GCGGTACCCA AGGGTGCAGC TGAAGCGTGA TACGCGTGAG GCGTACGTGC' CGCGGCAGAA 
CCTGTTTCGC GACCGCGAGG GAGAGGAGCC CGAGGAGATG CGGGATCGAA AGTTCCACGC 
AGGGCGCGAG CTGCGGCATG GCCTGAATCG CGAGCGGTTG CTGCGCGAGG AGGACTTTGA 
GCCCGACGCG CGAACCGGGA TTAGTCCCGC GCGCGCACAC GTGGCGGCCG CCGACCTGGT 
AACCGCATAC GAGCAGACGG TGAACCAGGA GATTAACTTT CAAAAAAGCT TTAACAACCA 13 0B0 
CGTGCGTACG CTTGTGGCGC GCGAGGAGGT GGCTATAGGA CTGATGCATC TGTGGGACTT X3140 
TGTAAGCGCG CTGGAGCAAA ACCCAAATAG CAAGCCGCTC ATGGCGCAGC TGTTCCTTAT 
AGTGCAGCAC AGCAGGGACA ACGAGGCATT CAGGGATGCG CTGCTAAACA TAGTAGAGCC 
CGAGGGCCGC TGGCTGCTCG ATTTGATAAA CATCCTGCAG AGCATAGTGG TGCAGGAGCG 
CAGCTTGAGC CTGGCTGACA AGGTGGCCGC CATCAACTAT TCCATGCTTA GCCTGGGCAA 
GTTTTACGCC CGCAAGATAT ACCATACCCC TTACGTTCCC ATAGACAAGG AGGTAAAGAT 
CGAGGGGTTC TACATGCGCA TGGCGCTGAA GGTGCTTACC TTGAGCGACG ACCTGGGCGT 
TTATCGCAAC GAGCGCATCC ACAAGGCCGT GAGCGTGAGC CGGCGGCGCG AGCTCAGCGA 13 560 
CCGCGAGCTG ATGCACAGCC TGCAAAGGGC CCTGGCTGGC ACGGGCAGCG GCGATAGAGA 13620 
GGCCGAGTCC TACTTTGACG CGGGCGCTGA CCTGCGCTGG GCCCCAAGCC GACGCGCCCT U680 



11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
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13020 
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GGAGGCAGCT 


GGGGCCGGAC 


CTGGGCTGGC 


CGGCGTGGAG 


GAATATGACG 


AGGACGATGA 


GGTGATGTTT 


CTGATCAGAT 


GATGCAAGAC 


CAGAGCCAGC 


CGTCCGGCCT 


TAACTCCACG 


ATGTCGCTGA 


CTGCGCGCAA 


TCCTGACGCG 


TCCGCAATTC 


TGGAAGCGGT 


GGTCCCGGCG 


GCGATCGTAA 


ACGCGCTGGC 


CGAAAACAGG 


TACGACGCGC 


TGCTTCAGCG 


CGTGGCTCGT 


GACCGGCTGG 


TGGGGGATGT 


GCGCGAGGCC 


GGCAACCTGG 


GCTCCATGGT 


TGCACTAAAC 


CCGCGGGGAC 


AGGAGGACTA 


CACCAACTTT 


ACACCGCAAA 


GTGAGGTGTA 


CCAGTCTGGG 


GGCCTGCAGA 


CCGTAAACCT 


GAGCCAGGCT 


CGGGCTCCCA 


CAGGCGACCG 


CGCGACCGTG 


TTGCTGCTGC 


TAATAGCGCC 


CTTCACGGAC 


GGTCACTTGC 


TGACACTGTA 


CCGCGAGGCC 


TTCCAGGAGA 


TTACAAGTGT 


CAGCCGCGCG 


GCAACCCTAA 


ACTACCTGCT 


GACCAACCGG 


AACAGCGAGG 


AGGAGCGCAT 


TTTGCGCTAC 


CGCGACGGGG 


TAACGCCCAG 


CGTGGCGCTG 


ATGTATGCCT 


CAAACCGGCC 


GTTTATCAAC 


GCCGTGAACC 


CCGAGTATTT 


CACCAATGCC 


GGTTTCTACA 


CCGGGGGATT 


CGAGGTGCCC 


ATAGACGACA 


GCGTGTTTTC 


CCCGCAACCG 


CAGGCAGAGG 


CGGCGCTGCG 


AAAGGAAAGC 


GGCGCTGCGG 


CCCCGCGGTC 


AGATGCTAGT 


ACCAGCACTC 


GCACCACCCG 


CCCGCGCCTG 


CTGCTGCAGC 


CGCAGCGCGA 


AAAAAACCTG 


AGCCTAGTGG 


ACAAGATGAG 


TAGATGGAAG 


GGCCCGCGCC 


CGCCCACCCG 


TCGTCAAAGG 


GACGATGACT 


CGGCAGACGA 


CAGCAGCGTC 


GCGCACCTTC 


GCCCCAGGCT 


GGGGAGAATG 


AAAAAACTCA 


CCAAGGCCAT 


GGCACCGAGC 


GGCGCGCGGC 


GATGTATGAG 


GAAGGTCCTC 


CGCCAGTGGC 


GGCGGCGCTG 


GGTTCTCCCT 
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GGTGGCACCC GCGCGCGC7G GCAACGTCGG 13740 

GTACGAGCCA GAGGACGGCG AGTACTAAGC 13800 

GCAACGGACC CGGCGGTGCG GGCGGCGCTG 13 86 0 

GACGACTGGC GCCAGGTCAT GGACCGCATC 13 920 

TTCCGGCAGC AGCCGCAGGC CAACCGGCTC 13980 

CGCGCAAACC CCACGCACGA GAAGGTGCTG 14 040 

GCCATCCGGC CCGACGAGGC CGGCCTGGTC 14100 

TACAACAGCG GCAACGTGCA GACCAACCTG 14160 

GTGGCGCAGC GTGAGCGCGC GCAGCAGCAG 14220 

GCCTTCCTGA GTACACAGCC CGCCAACGTG 142 8 0 

GTGAGCGCAC TGCGGCTAAT GGTGACTGAG 14 34 0 

CCAGACTATT TTTTCCAGAC CAGTAGACAA 14400 

TTCAAAAACT TGCAGGGGCT GTGGGGGGTG 144 60 

TCTAGCTTGC TGACGCCCAA CTCGCGCCTG 14520 

AGTGGCAGCG TGTCCCGGGA CACATACCTA 14580 

ATAGGTCAGG CGCATGTGGA CGAGCATACT 14640 

CTGGGGCAGG AGGACACGGG CAGCCTGGAG 14 700 

CGGCAGAAGA TCCCCTCGTT GCACAGTTTA 14760 

GTGCAGCAGA GCGTGAGCCT TAACCTGATG 14 820 

GACATGACCG CGCGCAACAT GGAACCGGGC 14 8 80 

CGCCTAATGG ACTACTTGCA TCGCGCGGCC 14 94 0 

ATCTTGAACC CGCACTGGCT ACCGCCCCCT 15000 

GAGGGTAACG ATGGATTCCT CTGGGACGAC 15060 

CAGACCCTGC TAGAGTTGCA ACAGCGCGAG 15120 

TTCCGCAGGC CAAGCAGCTT GTCCGATCTA 1518 0 

AGCCCATTTC CAAGCTTGAT AGGGTCTCTT 15240 

CTGGGCGAGG AGGAGTACCT AAACAACTCG 153 00 

CCTCCGGCAT TTCCCAACAA CGGGATAGAG 15360 

ACGTACGCGC AGGAGCACAG GGACGTGCCA 15420 

CACGACCGTC AGCGGGGTCT GGTGTGGGAG 15480 

CTGGATTTGG GAGGGAGTGG CAACCCGTTT 1554 0 

TTTTAAAAAA AAAAAAGCAT GATGCAAAAT 15600 

GTTGGTTTTC TTGTATTCCC CTTAGTATGC 15660 

CTCCCTCCTA CGAGAGTGTG GTGAGCGCGG 15720 

TCGATGCTCC CCTGGACCCG CCGTTTGTGC 15780 
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CTCCGCGGTA CCTGCGGCCT ACCGGGGGGA GAAACAGCAT CCGTTACTCT GAGTTGGCAC 15840 

CCCTATTCGA CACCACCCGT GTGTACCTGG TGGACAACAA GTCAACGGAT GTGGCATCCC 15900 

TGAACTACCA GAACGACCAC AGCAACTTTC TGACCACGGT CATTCAAAAC AATGACTACA 15960 

GCCCGGGGGA GGCAAGCACA CAGACCATCA ATCTTGACGA CCGGTCGCAC TGGGGCGGCG 16020 

ACCTGAAAAC CATC CTG CAT ACCAACATGC CAAATGTGAA CGAGTTCATG TTTACCAATA 16080 

AGTTTAAGGC GCGGGTGATG GTGTCGCGCT TGCCTACTAA GGACAATCAG GTGGAGCTGA 16140 

AATACGAGTG GGTGGAGTTC ACGCTGCCCG AGGGCAACTA CTCCGAGACC ATGACCATAG 16200 

ACCTTATGAA CAACGCGATC GTGGAGCACT ACTTGAAAGT GGGCAGACAG AACGGGGTTC 16260 

TGGAAAGCGA CATCGGGGTA AAGTTTGACA CCCGCAACTT CAGACTGGGG TTTGACCCCG 16320 

TCACTGGTCT TGTCATGCCT GGGGTATATA CAAACGAAGC CTTCCATCCA GACATCATTT 16380 

TGCTGCCAGG ATGCGGGGTG GACTTCACCC ACAGCCGCCT GAGCAACTTG TTGGGCATCC 1644 0 

GCAAGCGGCA ACCCTTCCAG GAGGGCTTTA GGATCACCTA CGATGATCTG GAGGGTGGTA 16500 

ACATTCCCGC ACTGTTGGAT GTGGACGCCT ACCAGGCGAG CTTGAAAGAT GACACCGAAC 16560 

AGGGCGGGGG TGGCGCAGGC GGCAGCAACA GCAGTGGCAG CGGCGCGGAA GAGAACTCCA 16620 

ACGCGGCAGC CGCGGCAATG CAGCCGGTGG AGGACATGAA CGATCATGCC ATTCGCGGCG 16680 

ACACCTTTGC CACACGGGCT GAGGAGAAGC GCGCTGAGGC CGAAGCAGCG GCCGAAGCTG 16740 

CCGCCCCCGC TGCGCAACCC GAGGTCGAGA AGCCTCAGAA GAAACCGGTG ATCAAACCCC 16800 

TGACAGAGGA CAGCAAGAAA CGCAGTTACA ACCTAATAAG CAATGACAGC ACCTTCACCC 16860 

AGTACCGCAG CTGGTACCTT GCATACAACT ACGGCGACCC TCAGACCGGA ATCCGCTCAT 16920 

GGACCCTGCT TTGCACTCCT GACGTAACCT GCGGCTCGGA GCAGGTCTAC TGGTCGTTGC 16980 

CAGACATGAT GCAAGACCCC GTGACCTTCC GCTCCACGCG CCAGATCAGC AACTTTCCGG 17040 

TGGTGGGCGC CGAGCTGTTG CCCGTGCACT CCAAGAGCTT CTACAACGAC CAGGCCGTCT 17100 

ACTCCCAACT CATCCGCCAG TTTACCTCTC TGACCCACGT GTTCAATCGC TTTCCCGAGA 17160 

ACCAGATTTT GGCGCGCCCG CCAGCCCCCA CCATCACCAC CGTCAGTGAA AACGTTCCTG 17220 

CTCTCACAGA TCACGGGACG CTACCGCTGC GCAACAGCAT CGGAGGAGTC CAGCGAGTGA 17280 

CCATTACTGA CGCCAGACGC CGCACCTGCC CCTACGTTTA CAAGGCCCTG GGCATAGTCT 17340 

CGCCGCGCGT CCTATCGAGC CGCACTTTTT GAGCAAGCAT GTCCATCCTT ATATCGCCCA 17400 

GCAATAACAC AGGCTGGGGC CTGCGCTTCC CAAGCAAGAT GTTTGGCGGG GCCAAGAAGC 17460 

GCTCCGACCA ACACCCAGTG CGCGTGCGCG GGCACTACCG CGCGCCCTGG GGCGCGCACA 17 520 

AACGCGGCCG CACTGGGCGC ACCACCGTCG ATGACGCCAT CGACGCGGTG GTGGAGGAGG 17580 

CGCGCAACTA CACGCCCACG CCGCCACCAG TGTCCACAGT GGACGCGGCC ATTCAGACCG 17640 

TGGTGCGCGG AGCCCGGCGC TATGCTAAAA TGAAGAGACG GCGGAGGCGC GTAGCACGTC 17700 

GCCACCGCCG CCGACCCGGC ACTGCCGCCC AACGCGCGGC GGCGGCCCTG CTTAACCGCG 17760 

CACGTCGCAC CGGCCGACGG GCGGCCATGC GGGCCGCTCG AAGGCTGGCC GCGGGTATTG 17820 

TCACTGTGCC CCCCAGGTCC AGGCGACGAG CGGCCGCCGC AGCAGCCGCG GCCATTAGTG 17880 
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CTATGACTCA 


GGGTCGCAGG 


GGCAACGTGT 


GCGTGCCCGT 


GCGCACCCGC 


CCCCCGCGCA 


CGTACTGTTG 


TATGTATCCA 


GCGGCGGCGG 


TCAAAGAAGA 


GATGCTCCAG 


GTCATCGCGC 


AGCAGGATTA 


CAAGCCCCGA 


AAGCTAAAGC 


ATGAACTTGA 


CGACGAGGTG 


GAACTGCTGC 


GGAAAGGTCG 


ACGCGTAAAA 


CGTGTTTTGC 


GTGAGCGCTC 


CACCCGCACC 


TACAAGCGCG 


TGCTTGAGCA 


GGCCAACGAG 


CGCCTCGGGG 


TGCTGGCGTT 


GCCGCTGGAC 


GAGGGCAACC 


AGCAGGTGCT 


GCCCGCGCTT 


GCACCGTCCG 


GTGACTTGGC 


ACCCACCGTG 


CAGCTGATGG 


TGGAAAAAAT 


GACCGTGGAA 


CCTGGGCTGG 


AGGTGGCGCC 


GGGACTGGGC 


GTGCAGACCG 


CCAGTATTGC 


CACCGCCACA 


GAGGGCATGG 


TGGCGGATGC 


CGCGGTGCAG 


GCGGTCGCTG 


AAACGGACCC 


GTGGATGTTT 


CGCGTTTCAG 


ACGGCGCCGC 


CAGCGCGCTA 


CTGCCCGAAT 


CCGGCTATCG 


TGGCTACACC 


TACCGCCCCA 


CCACTGGAAC 


CCGCCGCCGC 


CGTCGCCGTC 


GCAGGGTGGC 


TCGCGAAGGA 


GGCAGGACCC 


GCATCGTTTA 


AAAGCCGGTC 


TTTGTGGTTC 


GTTTCCCGGT 


GCCGGGATTC 


CGAGGAAGAA 


GCCTGACGGG 


CGGCATGCGT 


CGTGCGCACC 


TGCGCGGCGG 


TATCCTGCCC 


CTCCTTATTC 


CCGGAATTGC 


ATCCGTGGCC 


TTGCAGGCGC 


GGAAAAATCA 


AAATAAAAAG 


TCTGGACTCT 


AGAATGGAAG 


ACATCAACTT 


TGCGTCTCTG 


GGAAACTGGC 


AAGATATCGG 


CACCAGCAAT 


CTGTGGAGCG 


GCATTAAAAA 


TTTCGGTTCC 


AACAGCAGCA 


CAGGCCAGAT 


GCTGAGGGAT 


GTGGTAGATG 


GCCTGGCCTC 


TGGCATTAGC 


CAAAATAAGA 


TTAACAGTAA 


GCTTGATCCC 


GTGGAGACAG 


TGTCTCCAGA 


GGGGCGTGGC 


ACTCTGGTGA 


CGCAAATAGA 


CGAGCCTCCC 
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ATTGGGTGCG CGACTCGGTT AGCGGCCTGC 17 940 

ACTAGATTGC AAGAAAAAAC TACTTAGACT 18 000 

CGCGCAACGA AGCTATGTCC AAGCGCAAAA 18 060 

CGGAGATCTA TGGCCCCCCG AAGAAGGAAG 1812 0 

GGGTCAAAAA GAAAAAGAAA GATGATGATG 1818 0 

ACGCTACCGC GCCCAGGCGA CGGGTACAGT 18240 

GACCCGGCAC CACCGTAGTC TTTACGCCCG 18 300 

TGTATGATGA GGTGTACGGC GACGAGGACC 18 360 

AGTTTGCCTA CGGAAAGCGG CATAAGGACA 18420 

CAACACCTAG CCTAAAGCCC GTAACACTGC 18 480 

AAGAAAAGCG CGGCCTAAAG CGCGAGTCTG 18 540 

TACCCAAGCG CCAGCGACTG GAAGATGTCT 18 600 

AGCCCGAGGT CCGCGTGCGG CCAATCAAGC 18 660 

TGGACGTTCA GATACCCACT ACCAGTAGCA 18 720 

AGACACAAAC GTCCCCGGTT GCCTCAGCGG 18 780 

CGGCCGCGTC CAAGACCTCT ACGGAGGTGC 18 840 

CCCCCCGGCG CCCGCGCGGT TCGAGGAAGT 18 900 

ATGCCCTACA TCCTTCCATT GCGCCTACCC 18 960 

GAAGACGAGC AACTACCCGA CGCCGAACCA 19020 

GCCAGCCCGT GCTGGCCCCG ATTTCCGTGC 19080 

TGGTGCTGCC AACAGCGCGC TACCACCCCA 1914 0 

TTGCAGATAT GGCCCTCACC TGCCGCCTCC 19200 

TGCACCGTAG GAGGGGCATG GCCGGCCACG 19260 

ACCGGCGGCG GCGCGCGTCG CACCGTCGCA 19320 

CACTGATCGC CGCGGCGATT GGCGCCGTGC 19 38 0 

AGAGACACTG ATTAAAAACA AGTTGCATGT 1944 0 

CACGCTCGCT TGGTCCTGTA ACTATTTTGT 19 500 

GCCCCGCGAC ACGGCTCGCG CCCGTTCATG 19560 

ATGAGCGGTG GCGCCTTCAG CTGGGGCTCG 19620 

ACCGTTAAGA ACTATGGCAG CAAGGCCTGG 19680 

AAGTTGAAAG AGCAAAATTT CCAACAAAAG 19740 

GGGGTGGTGG ACCTGGCCAA CCAGGCAGTG 19800 

CGCCCTCCCG TAGAGGAGCC TCCACCGGCC 19860 

GAAAAGCGTC CGCGCCCCGA CAGGGAAGAA 19920 

TCGTACGAGG AGGCACTAAA GCAAGGCCTG 19980 
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CCCACCACCC 


GTCCCATPGP 


GPPP ^ TGPPT 
Uv-v.v.ni uuL 1 


t r*r*r*r* t\ ptpp 
AL LbbAbTGC 


TGGGCCAGCA 


PCT/US97/1954] 

CACACCCGTA 20040 


ACGCTGGACC 


TGCCTPPPPP 


PGPPPTaP TaPP 
v.\jv.HjMv,AH. 


PTVPPTVPTV TV TVP 

CAGCAGAAAC 


CTGTGCTGCC 


AGGCCCGACC 


20100 


GCCGTTGTTG 


TAAPPPGTPP 


t 21 pppnr*ppp 


1 LLLTGCGCC 


GCGCCGCCAG 


CGGTCCGCGA 


20160 


TCGTTGCGGC 


CCGTAGPPAG 


TGP.PZ1 UPTPP 


PTVTV APPTVPTVP 

LAAAGLALAC 


TGAACAGCAT 


CGTGGGTCTG 


20220 


GGGGTGCAAT 


CCCTGAAGPG 


PPH A PP 7A TPP 


TTPTP TV TV fft P 

TTCTGAATAG 


CTAACGTGTC 


GTATGTGTGT 


20280 


CATGTATGCG 


TCPATGTPGP 


PPPPAP7A PPTv 
\_LjL LAIjAvoVjA 


GCTGCTGAGC 


CGCCGCGCGC 


CCGCTTTCCA 


20340 


AGATGGCTAC 


PPPTTPP7ATP. 


A 1 (jLCbLAGT 


GGTCTTACAT 


GCACATCTCG 


GGCCAGGACG 


2 04 00 


CCTCGGAGTA 


CCTGAGPPPP 


cr.r.rTPPTPp 


AGTTTGCCCG 


CGCCACCGAG 


ACGTACTTCA 


20460 


GCCTGAATAA 


P A7A.GTTT7AP7A 
^-nnu ill AGA 


TV TVPPPPta /">/*»/*• 

AALLt_CACGG 


TGGCGCCTAC 


GCACGACGTG 


ACCACAGACC 


20520 


GGTCCPAGPG 


TTTG LPGPTP. 


Lljb 1 I CATCC 


CTGTGGACCG 


TGAGGATACT 


GCGTACTCGT 


20580 


ACAAGGPGPG 


GTTPHPPPT7A 
oil IMLLL 1 A 


p pfpTp p p^p 
bL lb J bGGTG 


ATAACCGTGT 


GCTGGACATG 


GCTTCCACGT 


20640 


APTTTGAPAT 


LLuLuuLu 1 o 


PTPP Srn^rn 

L IbGACAGGG 


GCCCTACTTT 


TAAGCCCTAC 


TCTGGCACTG 


20700 


CCTACAAPG P 


PPTPP.ptppp 


TV A PP PH*»P / - *P/*» 

AAGGGTGCCC 


CAAATCCTTG 


CGAATGGGAT 


GAAGCTGCTA 


20760 


CTGC^PTTG A 


7a 7a tta ta 7\ rr"VT\ 


(jAAGAAGAGG 


ACGATGACAA 


CGAAGACGAA 


GTAGACGAGC 




azigptpzip.P7a 


PP7\TVTVTVTVTt 

GLAAAAAACT 


CACGTATTTG 


GGCAGGCGCC 


TTATTCTGGT 


ATAAATATTA 




p A a a czn a rzczn 


T' Ta TTP TV TV 

I A TTL. AAATA 


GGTGTCGAAG 


GTCAAACACC 


TAAATATG CC 


GATAAAACAT 


2 094 0 


TTPA APPTP. A 


& P PT P T\ 7\ 7\ T 1 T\ 

ACL I L.AAATA 


GGAGAATCTC 


AGTGGTACGA 


AACTGAAATT 


AATCATGCAG 


21000 




fPTTft Tv TV Tv ftp 

LL1 I AAAAAG 


ACTACCCCAA 


TGAAACCATG 


TTACGGTTCA 


TATGCAAAAC 




PPAPA A 21 TP. 21 


AAA I GGAGGG 


CAAGGCATTC 


TTGTAAAGCA 


ACAAAATGGA 


AAGCTAGAAA 


2112 0 


G TP A Zl GTG P A 


7A 7\ TP P TV TV i"P' T" 1 1 

AA 1 GLAATTT 


TTCTCAACTA 


CTGAGGCGAC 


CGCAGGCAAT GGTGATAACT 


21180 


TGAPTPPTH 21 


7A C Tf fTft TTP 

AG 1 GG 1 ATTG 


TACAGTGAAG 


ATGTAGATAT 


AGAAACCCCA 


GACACTCATA 


21240 


TTTPTT A PUT 


pppp n rT7\ tt 


AAGGAAGGTA 


ACTCACGAGA 


ACTAATGGGC 


CAACAATCTA 


21300 


TGPPPAAPAG 


oL-L 1 AA1 I AL 


ATTGCTTTTA 


GGGACAATTT 


TATTGGTCTA 


ATGTATTACA 


21360 


ACAGCACGGG 


1 >»>i 1 A I uGG 1 


GTTCTGGCGG 


GCCAAGCATC 


GCAGTTGAAT 


GCTGTTGTAG 


21420 


ATTTGPAAGA 


P 7A f Tv 7\ 7\PTi P7\ 

GAGAAAGACA 


GAGCTTTCAT ACCAGCTTTT GCTTGATTCC 


ATTGGTGATA 


21480 


GAACPAGGTA 


PTTTTPT T\ TP 


TGGAATCAGG 


CTGTTGACAG 


CTATGATCCA 


GATGTTAGAA 


2154C 


TTATTGAAAA 


TPBTrr T\ 7\ PT 
I V_ A 1 IjLjAAL 1 


GAAGATGAAC 


TTCCAAATTA 


CTGCTTTCCA 


CTGGGAGGTG 


21600 


TGATTAATAP 


AGZiPTAPTPTT 


ACCAAGGTAA 


AACCTAAAAC 


AGGTCAGGAA 


AATGGATGGG 


21660 


AAAAAGATGP 


T 7A P 7i P TV TV T'TT 1 

1 ALAbAAT T T 


TCAGATAAAA 


ATGAAATAAG 


AGTTGGAAAT 


AATTTTGCCA 


21720 


TGGAAATPAA 


TPT7A Ti atppp 
1 L 1 AAA! (jLL 


AACCTGTGGA 


GAAATTTCCT 


GTACTCCAAC 


ATAGCGCTGT 


21780 


ATTTGCPPGA 


P7i 7APPTTA ta tap 
UMAtaL 1 AAAo 


TACAGTCCTT 


CCAACGTAAA 


AATTTCTGAT 


AACCCAAACA 


i ± 0 *i u 


CCTACGACTA 


P ATP. 21 Zl P 71 a P. 


CGAGTGGTGG 


CTCCCGGGTT 


AGTGGACTGC 


TACATTAACC 


21900 


TTGGAGCACG 


CTGGTCCCTT 


GACTATATGG 


ACAACGTCAA 


CCCATTTAAC 


CACCACCGCA 


21960 


ATGCTGGCCT 


GCGCTACCGC 


TCAATGTTGC 


TGGGCAATGG 


TCGCTATGTG 


CCCTTCCACA 


2202C 


TCCAGGTGCC 


TCAGAAGTTC 


TTTGCCATTA 


AAAACCTCCT 


TCTCCTGCCG 


GGCTCATACA 


22080 
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CCTACGAGTG 
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AAGGATGTTA 
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CTAGGAAATG 2214 0 


ACCTAAGGGT 


TGACGGAGCC 


AGCATTAAGT 


TTGATAGCAT 


TTGCCTTTAC 


GCCACCTTCT 


22200 


TCCCCATGGC 


CCACAACACC 


GCCTCCACGC 


TTGAGGCCAT 


GCTTAGAAAC 


GACACCAACG 


22260 


ACCAGTCCTT 


TAACGACTAT 


CTCTCCGCCG 


CCAACATGCT 


CTACCCTATA 


CCCGCCAACG 


22320 


CTACCAACGT 


GCCCATATCC 


ATCCCCTCCC 


GCAACTGGGC 


GGCTTTCCGC 


GGCTGGGCCT 


22380 


TCACGCGCCT 


TAAGACTAAG 


GAAACCCCAT 


CACTGGGCTC 


GGGCTACGAC 


CCTTATTACA 


22440 


CCTACTCTGG 


CTCTATACCC 


TACCTAGATG 


GAACCTTTTA 


CCTCAACCAC 


ACCTTTAAGA 


22500 


AGGTGGCCAT 


TACCTTTGAC 


TCTTCTGTCA 


GCTGGCCTGG 


CAATGACCGC 


CTGCTTACCC 


22560 


CCAACGAGTT 


TGAAATTAAG 


CGCTCAGTTG 


ACGGGGAGGG 


TTACAACGTT 


GCCCAGTGTA 


22620 


ACATGACCAA 


AGACTGGTTC 


CTGGTACAAA 


TGCTAGCTAA 


CTACAACATT 


GGCTACCAGG 


22680 


GCTTCTATAT 


CCCAGAGAGC 


TACAAGGACC 


GCATGTACTC 


CTTCTTTAGA 


AACTTCCAGC 


22740 


CCATGAGCCG 


TCAGGTGGTG 


GATGATACTA 


AATACAAGGA 


CTACCAACAG 


GTGGGCATCC 


22800 


TACACCAACA 


CAACAACTCT 


GGATTTGTTG 


GCTACCTTGC 


CCCCACCATG 


CGCGAAGGAC 


22860 


AGGCCTACCC 


TGCTAACTTC 


CCCTATCCGC 


TTATAGGCAA 


GACCGCAGTT 


GACAGCATTA 


22920 


CCCAGAAAAA 


GTTTCTTTGC 


GATCGCACCC 


TTTGGCGCAT 


CCCATTCTCC 


AGTAACTTTA 


22980 


TGTCCATGGG 


CGCACTCACA 


GACCTGGGCC 


AAAACCTTCT 


CTACGCCAAC 


TCCGCCCACG 


23040 


CGCTAGACAT 


GACTTTTGAG 


GTGGATCCCA 


TGGACGAGCC 


CACCCTTCTT 


TATGTTTTGT 


23100 


TTGAAGTCTT 


TGACGTGGTC 


CGTGTGCACC 


GGCCGCACCG 


CGGCGTCATC 


GAAACCGTGT 


23160 


ACCTGCGCAC 


GCCCTTCTCG 


GCCGGCAACG 


CCACAACATA 


AAGAAGCAAG 


CAACATCAAC 


23220 


AACAGCTGCC 


GCCATGGGCT 


CC AGTGAGCA 


GGAACTGAAA 


GCCATTGTCA 


AAGATCTTGG 


23280 


TTGTGGGCCA 


TATTTTTTGG 


GCACCTATGA 


CAAGCGCTTT 


CCAGGCTTTG 


TTTCTCCACA 


23340 


CAAGCTCGCC 


TGCGCCATAG 


TCAATACGGC 


CGGTCGCGAG 


ACTGGGGGCG 


TACACTGGAT 


23400 


GGCCTTTGCC 


TGGAACCCGC 


ACTCAAAAAC 


ATGCTACCTC 


TTTGAGCCCT 


TTGGCTTTTC 


23460 


TGACCAGCGA 


CTCAAGCAGG 


TTTACCAGTT 


TGAGTACGAG 


TCACTCCTGC 


GCCGTAGCGC 


23520 


CATTGCTTCT 


TCCCCCGACC 


GCTGTATAAC 


GCTGGAAAAG 


TCCACCCAAA 


GCGTACAGGG 


23580 


GCCCAACTCG 


GCCGCCTGTG 


GACTATTCTG 


CTGCATGTTT 


CTCCACGCCT 


TTGCCAACTG 


23640 


GCCCCAAACT 


CCCATGGATC 


ACAACCCCAC 


CATGAACCTT 


ATTACCGGGG 


TACCCAACTC 


23700 


CATGCTCAAC 


AGTCCCCAGG 


TACAGCCCAC 


CCTGCGTCGC 


AACCAGGAAC 


AGCTCTACAG 


23760 


CTTCCTGGAG 


CGCCACTCGC 


CCTACTTCCG 


CAGCCACAGT 


GCGCAGATTA 


GGAGCGCCAC 


23820 


TTCTTTTTGT 


CACTTGAAAA 


ACATGTAAAA 


ATAATGTACT 


AGAGACACTT 


TCAATAAAGG 


23880 


CAAATGCTTT 


TATTTGTACA 


CTCTCGGGTG 


ATTATTTACC 


CCCACCCTTG 


CCGTCTGCGC 


23940 


CGTTTAAAAA 


TCAAAGGGGT 


TCTGCCGCGC 


ATCGCTATGC 


GCCACTGGCA 


GGGACACGTT 


24000 


GCGATACTGG 


TGTTTAGTGC 


TCCACTTAAA 


CTCAGGCACA 


ACCATCCGCG 


GCAGCTCGGT 


24060 


GAAGTTTTCA 


CTCCACAGGC 


TGCGCACCAT 


CACCAACGCG 


TTTAGCAGGT 


CGGGCGCCGA 


24120 


TATCTTGAAG 


TCGCAGTTGG 


GGCCTCCGCC 


CTGCGCGCGC 


GAGTTGCGAT 


ACACAGGGTT 


24180 
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GCAGCACTGG AACACTATCA GCGCCGGGTG GTGCACGCTG GCCAGCACGC TCTTGTCGGA 24 24 0 

GATCAGATCC GCGTCCAGGT CCTCCGCGTT GCTCAGGGCG AACGGAGTCA ACTTTGGTAG 24 300 

CTGCCTTCCC AAAAAGGGCG CGTGCCCAGG CTTTGAGTTG CACTCGCACC GTAGTGGCAT 24 36 0 

CAAAAGGTGA CCGTGCCCGG TCTGGGCGTT AGGATACAGC GCCTGCATAA AAGCCTTGAT 2442 0 

CTGCTTAAAA GCCACCTGAG CCTTTGCGCC TTCAGAGAAG AACATGCCGC AAGACTTGCC 24480 

GGAAAACTGA TTGGCCGGAC AGGCCGCGTC GTGCACGCAG CACCTTGCGT CGGTGTTGGA 24 540 

GATCTGCACC ACATTTCGGC CCCACCGGTT CTTCACGATC TTGGCCTTGC TAGACTGCTC 24600 

CTTCAGCGCG CGCTGCCCGT TTTCGCTCGT CACATCCATT TCAATCACGT GCTCCTTATT 24660 

TATCATAATG CTTCCGTGTA GACACTTAAG CTCGCCTTCG ATCTCAGCGC AGCGGTGCAG 24 720 

CCACAACGCG CAGCCCGTGG GCTCGTGATG CTTGTAGGTC ACCTCTGCAA ACGACTGCAG 24 78 0 

GTACGCCTGC AGGAATCGCC CCATCATCGT CACAAAGGTC TTGTTGCTGG TGAAGGTCAG 24 840 

CTGCAACCCG CGGTGCTCCT CGTTCAGCCA GGTCTTGCAT ACGGCCGCCA GAGCTTCCAC 24900 

TTGGTCAGGC AGTAGTTTGA AGTTCGCCTT TAGATCGTTA TCCACGTGGT ACTTGTCCAT 24960 

CAGCGCGCGC GCAGCCTCCA TGCCCTTCTC CCACGCAGAC ACGATCGGCA CACTCAGCGG 2 5020 

GTTCATCACC GTAATTTCAC TTTCCGCTTC GCTGGGCTCT TCCTCTTCCT CTTGCGTCCG 25080 

CATACCACGC GCCACTGGGT CGTCTTCATT CAGCCGCCGC ACTGTGCGCT TACCTCCTTT 2514 0 

GCCATGCTTG ATTAGCACCG GTGGGTTGCT GAAACCCACC ATTTGTAGCG CCACATCTTC 2 5200 

TCTTTCTTCC TCGCTGTCCA CGATTACCTC TGGTGATGGC GGGCGCTCGG GCTTGGGAGA 25260 

AGGGCGCTTC TTTTTCTTCT TGGGCGCAAT GGCCAAATCC GCCGCCGAGG TCGATGGCCG 25320 

CGGGCTGGGT GTGCGCGGCA CCAGCGCGTC TTGTGATGAG TCTTCCTCGT CCTCGGACTC 25380 

GATACGCCGC CTCATCCGCT TTTTTGGGGG CGCCCGGGGA GGCGGCGGCG ACGGGGACGG 25440 

GGACGACACG TCCTCCATGG TTGGGGGACG TCGCGCCGCA CCGCGTCCGC GCTCGGGGGT 2 5500 

GGTTTCGCGC TGCTCCTCTT CCCGACTGGC CATTTCCTTC TCCTATAGGC AGAAAAAGAT 2 5560 

CATGGAGTCA GTCGAGAAGA AGGACAGCCT AACCGCCCCC TCTGAGTTCG CCACCACCGC 25620 

CTCCACCGAT GCCGCCAACG CGCCTACCAC CTTCCCCGTC GAGGCACCCC CGCTTGAGGA 2 5680 

GGAGGAAGTG ATTATCGAGC AGGACCCAGG TTTTGTAAGC GAAGACGACG AGGACCGCTC 2 5740 

AGTACCAACA GAGGATAAAA AGCAAGACCA GGACAACGCA GAGGCAAACG AGGAACAAGT 25800 

CGGGCGGGGG GACGAAAGGC ATGGCGACTA CCTAGATGTG GGAGACGACG TGCTGTTGAA 25860 

GCATCTGCAG CGCCAGTGCG CCATTATCTG CGACGCGTTG CAAGAGCGCA GCGATGTGCC ' 25-920 

CCTCGCCATA GCGGATGTCA GCCTTGCCTA CGAACGCCAC CTATTCTCAC CGCGCGTACC 25980 

CCCCAAACGC CAAGAAAACG GCACATGCGA GCCCAACCCG CGCCTCAACT TCTACCCCGT 26040 

ATTTGCCGTG CCAGAGGTGC TTGCCACCTA TCACATCTTT TTCCAAAACT GCAAGATACC 26100 

CCTATCCTGC CGTGCCAACC GCAGCCGAGC GGACAAGCAG CTGGCCTTGC GGCAGGGCGC 26160 

TGTCATACCT GATATCGCCT CGCTCAACGA AGTGCCAAAA ATCTTTGAGG GTCTTGGACG 26220 

CGACGAGAAG CGCGCGGCAA ACGCTCTGCA ACAGGAAAAC AGCGAAAATG AAAGTCACTC 26280 
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TGGAGTGTTG 


GTGGAACTCG 


AGGGTGACAA 


CGCGCGCCTA 


PCT/US97/19541 

GCCGTACTAA AACGCAGCAT 26 34 0 


CGAGGTCACC 


CACTTTGCCT 


ACCCGGCACT 


TAACCTACCC 


CCCAAGGTCA 


TGAGCACAGT 


26400 


CATGAGTGAG 


CTGATCGTGC 


GCCGTGCGCA 


GCCCCTGGAG 


AGGGATGCAA 


ATTTGCAAGA 


26460 


ACAAACAGAG 


GAGGGCCTAC 


CCGCAGTTGG 


CGACGAGCAG 


CTAGCGCGCT 


GGCTTCAAAC 


26520 


GCGCGAGCCT 


GCCGACTTGG 


AGGAGCGACG 


CAAACTAATG 


ATGGCCGCAG 


TGCTCGTTAC 


26580 


CGTGGAGCTT 


GAGTGCATGC 


AGCGGTTCTT 


TGCTGACCCG 


GAGATGCAGC 


GCAAGCTAGA 


26640 


GGAAACATTG 


CACTACACCT 


TTCGACAGGG 


CTACGTACGC 


CAGGCCTGCA 


AGATCTCCAA 


26700 


CGTGGAGCTC 


TGCAACCTGG 


TCTCCTACCT 


TGGAATTTTG 


CACGAAAACC 


GCCTTGGGCA 


26760 


AAACGTGCTT 


CATTCCACGC 


TCAAGGGCGA 


GGCGCGCCGC 


GACTACGTCC 


GCGACTGCGT" 


26820 


TTACTTATTT 


CTATGCTACA 


CCTGGCAGAC 


GGCCATGGGC 


GTTTGGCAGC 


AGTGCTTGGA 


26880 


GGAGTGCAAC 


CTCAAGGAGC 


TGCAGAAACT 


GCTAAAGCAA 


AACTTGAAGG 


ACCTATGGAC 


26940 


GGCCTTCAAC 


GAGCGCTCCG 


TGGCCGCGCA 


CCTGGCGGAC 


ATCATTTTCC 


CCGAACGCCT 


27000 


GCTTAAAACC 


CTGCAACAGG 


GTCTGCCAGA 


CTTCACCAGT 


CAAAGCATGT 


TGCAGAACTT 


27060 


TAGGAACTTT 


ATCCTAGAGC 


GCTCAGGAAT 


CTTGCCCGCC 


ACCTGCTGTG 


CACTTCCTAG 


27120 


CGACTTTGTG 


CCCATTAAGT 


ACCGCGAATG 


CCCTCCGCCG 


CTTTGGGGCC 


ACTGCTACCT 


27180 


TCTGCAGCTA 


GCCAACTACC 


TTGCCTACCA 


CTCTGACATA 


ATGGAAGACG 


TGAGCGGTGA 


27240 


CGGTCTACTG 


GAGTGTCACT 


GTCGCTGCAA 


CCTATGCACC 


CCGCACCGCT 


CCCTGGTTTG 


27300 


CAATTCGCAG 


CTGCTTAACG 


AAAGTCAAAT 


TATCGGTACC 


TTTGAGCTGC 


AGGGTCCCTC 


27360 


GCCTGACGAA 


AAGTCCGCGG 


CTCCGGGGTT 


GAAACTCACT 


CCGGGGCTGT 


GGACGTCGGC 


27420 


TTACCTTCGC 


AAATTTGTAC 


CTGAGGACTA 


CCACGCCCAC 


GAGATTAGGT 


TCTACGAAGA 


27480 


CCAATCCCGC 


CCGCCAAATG 


CGGAGCTTAC 


CGCCTGCGTC 


ATTACCCAGG 


GCCACATTCT 


27540 


TGGCCAATTG 


CAAGCCATCA 


ACAAAGCCCG 


CCAAGAGTTT 


CTGCTACGAA 


AGGGACGGGG 


27600 


GGTTTACTTG 


GACCCCCAGT 


CCGGCGAGGA 


GCTCAACCCA 


ATCCCCCCGC 


CGCCGCAGCC 


27660 


CTATCAGCAG 


CAGCCGCGGG* CCCTTGCTTC 


CCAGGATGGC 


ACCCAAAAAG 


AAGCTGCAGC 


27720 


TGCCGCCGCC 


ACCCACGGAC 


GAGGAGGAAT 


ACTGGGACAG 


TCAGGCAGAG 


GAGGTTTTGG 


27780 


ACGAGGAGGA 


GGAGGACATG 


ATGGAAGACT 


GGGAGAGCCT 


AGACGAGGAA 


GCTTCCGAGG 


27840 


TCGAAGAGGT 


GTCAGACGAA 


ACACCGTCAC 


CCTCGGTCGC 


ATTCCCCTCG 


CCGGCGCCCC 


27900 


AGAAATCGGC 


AACCGGTTCC 


AGCATGGCTA 


CAACCTCCGC 


TCCTCAGGCG 


CCGCCGGCAC 


27960 


TGCCCGTTCG 


CCGACCCAAC 


CGTAGATGGG 


ACACCACTGG 


AACCAGGGCC 


GGTAAGTCCA 


28020 


AGCAGCCGCC 


GCCGTTAGCC 


CAAGAGCAAC 


AACAGCGCCA 


AGGCTACCGC 


TCATGGCGCG 


28080 


GGCACAAGAA 


CGCCATAGTT 


GCTTGCTTGC 


AAGACTGTGG 


GGGCAACATC 


TCCTTCGCCC 


28140 


GCCGCTTTCT 


TCTCTACCAT 


CACGGCGTGG 


CCTTCCCCCG 


TAACATCCTG 


CATTACTACC 


28200 


GTCATCTCTA 


CAGCCCATAC 


TGCACCGGCG 


GCAGCGGCAG 


CGGCAGCAAC 


AGCAGCGGCC 


28260 


ACACAGAAGC 


AAAGGCGACC 


GGATAGCAAG 


ACTCTGACAA 


AGCCCAAGAA 


ATCCACAGCG 


28320 


GCGGCAGCAG 


CAGGAGGAGG 


AGCGCTGCGT 


CTGGCGCCCA 


ACGAACCCGT 


ATCGACCCGC 


28380 
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GAGCT7AGAA 


ACAGGATTTT 


TCCCACTCTG 


TATGCTATAT 


TTCAArzinar 


CAGGGGCCAA 


2844 0 


GAACAAGAGC 


TGAAAATAAA 


AAACAGGTCT 


CTGCGATCCC 


TCAcernrAn 


CTGCCTGTAT 


28500 


CACAAAAGCG 


AAGATCAGCT 


TCGGCGCACG 


CTGGAAGACG 


v-uunV3UL 1 V. J. 


CTTCAGTAAA 


28560 


TACTGCGCGC 


TGACTCTTAA 


GGACTAGTTT 


CGCGCCCTTT 


CTCAAATTTA 


AGCGCGAAAA 


28620 


CTACGTCATC 


TCCAGCGGCC 


ACACCCGGCG 


CCAGCACCTG 


TCGTCAC1CC1C 


CATTATGAGC 


28680 


AAGGAAATTC 


CCACGCCCTA 


CATGTGGAGT 


TACCAGCCAC 


AA A TP. (7 P. a PT 


TGCGGCTGGA 


2874 0 


GCTGCCCAAG 


ACTACTCAAC 


CCGAATAAAC 


TACATGAGCG 


pp.ftp,APPPP7A 


CATGATATCC 


2 8800 


CGGGTCAACG 


GAATCCGCGC 


CCACCGAAAC 


CGAATTCTCT 


T cz p. a 7i p ta p p r 


GGCTATTACC 




ACCACACCTC 


GTAATAACCT 


TAATCCCCGT AGTTGGCCCG 


L lbLLCTGGT 


GTACCAGGAA 




AGTCCCGCTC 


CCACCACTGT 


GGTACTTCCC 


AGAGACGCCC 


AGbL CGAAGT 


TCAGATGACT 




AACTCAGGGG 


CGCAGCTTGC 


GGGCGGCTTT 


CGTCACAGGG 


IbCGGTCGCC 


CGGGCAGGGT 




ATAACTCACC 


TGACAATCAG 


AGGGCGAGGT 


ATTCAGCTCA 


ACGACGAGTC 


GGTGAGCTCC 


*)qi fin 


TCGCTTGGTC 


TCCGTCCGGA 


CGGGACATTT 


CAGATCGGPfi 


GLGCCGGCCG 


TCCTTCATTP 




ACGCCTCGTC 


AGGCAATCCT 


AACTCTGCAG 


ACCTCGTCCT 


L J. GAGCCGCG 


CTCTGGAGGC 




ATTGGAACTC 


TGCAATTTAT 


TGAGGAGTTT 


G TG C C A t can 


I LTACTTTAA 


CCCCTTCTCG 




GGACCTCCCG 


GCCACTATCC 


GGATCAATTT 


ATTCCTAACT 


1 I GALGCGGT 


AAAGGACTCG 




GCGGACGGCT 


ACGACTGAAT 


GTTAATTAAG 


TTCCTGTCCA 


I LCGCACCCA 


CTATCTTCAT 




GTTGTTGCAG 


ATGAAGCGCG 


CAAGACCGTC 


TGAAGATACC 


1 1 LAACCCCG 


TGTATCCATA 




TGACACGGAA 


ACCGGTCCTC 


CAACTGTGCC 


TTTTCTTACT 


LLTCCCTTTG 


TATCCCCCAA 




TGGGTTTCAA 


GAGAGTCCCC 


CTGGGGTACT 


CTCTTTGCGC 


<- 1 A I LCGAAC 


CTCTAGTTAC 




CTCCAATGGC 


ATGCTTGCGC 


TCAAAATGGG 


CAACGGCCTC 


1L1 LTGGACG 


AGGCCGGCAA 




CCTTACCTCC 


CAAAATGTAA 


CCACTGTGAG 


CCCACCTCTC 


A A A T\ TV TV A /"■• Tv 

AAAAAAACCA 


AGTCAAACAT 




AAACCTGGAA 


ATATCTGCAC 


CCCTCACAGT 


TACCTCAGAA 


vjLLCTAACTG 


TGGCTGCCGC 




CGCACCTCTA 


ATGGTCGCGG 


GCAACACACT 


CACCATGCAA 


i LACAGGCCC 


CGCTAACPGT 




GCACGACTCC 


AAACTTAGCA 


TTGCCACCCA 


AGGACCCCTC 


ACAGTGTCAG 


AAGGAAAGCT 




AGCCCTGCAA 


ACATCAGGCC 


CCCTCACCAC 


CACCGATAGC 


Tv f"P Tv PPnTifTi* 

AGTACCCTTA 


CTATCACTGC 


2 994 0 


CTCACCCCCT 


CTAACTACTG 


CCACTGGTAG 


CTTGGGCATT 


GACTTGAAAG 


AGCCCATTTA 


30000 


TACACAAAAT 


GGAAAACTAG 


GACTAAAGTA 


CGGGGCTCCT 


TTPPTiTPTTi T. 

1 lot-AiGTAA 


CAGACGACCT 


30060 


AAACACTTTG 


ACCGTAGCAA 


CTGGTCCAGG 


TGTGACTATT 


7A 7A TTa ATA /"*t»»"p 

AA I AA I AL.TT 


CCTTGCAAAC 


30120 


TAAAGTTACT 


GGAGCCTTGG 


GTTTTGATTC 


ACAAGGCAAT 


Ai VjUAAL I X A 


ATGTAGCAGG 


30180 


AGGACTAAGG 


ATTGATTCTC 


AAAACAGACG 


CCTTATACTT 


P. 7i T'PTT A /*"T"T 
IjA 1 L) i 1 AG I T 


ATCCGTTTGA 


30240 


TGCTCAAAAC 


CAACTAAATC 


TAAGACTAGG 


ACAGGGCCCT 


v- 1 I I I I AI AA 


ACTCAGCrrA 




CAACTTGGAT 


ATTAACTACA 


ACAAAGGCCT 


TTACTTGTTT 


ACAGCTTCAA 


ACAATTCCAA 


30360 


AAAGCTTGAG 


GTTAACCTAA 


GCACTGCCAA 


GGGGTTGATG 


TTTGACGCTA 


CAGCCATAGC 


30420 


CATTAATGCA 


GGAGATGGGC 


TTGAATTTGG 


TTCACCTAAT 


GCACCAAACA 


CAAATCCCCT 


30480 
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CAAAACAAAA 


ATTGGCCATG 


GCCTAGAATT 


TGATTCAAAC 


AAGGCTATGG 


TTCCTAAACT 


30540 


AGGAACTGGC 


CTTAGTTTTG 


ACAGCACAGG 


TGCCATTACA 


GTAGGAAACA 


AAAATAATGA 


30600 


TAAGCTAACT 


TTGTGGACCA 


CACCAGCTCC 


ATCTCCTAAC 


TGTAGACTAA 


ATGCAGAGAA 


30660 


AGATGCTAAA 


CTCACTTTGG 


TCTTAACAAA 


ATGTGGCAGT 


CAAATACTTG 


CTACAGTTTC 


30720 


AGTTTTGGCT 


GTTAAAGGCA 


GTTTGGCTCC 


AATATCTGGA 


ACAGTTCAAA 


GTGCTCATCT 


30780 


TATTATAAGA 


TTTGACGAAA 


ATGGAGTGCT 


ACTAAACAAT 


TCCTTCCTGG 


ACCCAGAATA 


30840 


TTGGAACTTT 


AGAAATGGAG 


ATCTTACTGA 


AGGCACAGCC 


TATACAAACG 


CTGTTGGATT 


30900 


TATGCCTAAC 


CTATCAGCTT 


ATCCAAAATC 


TCACGGTAAA 


ACTGCCAAAA 


GTAACATTGT 


30960 


CAGTCAAGTT 


TACTTAAACG 


GAGACAAAAC 


TAAACCTGTA 


ACACTAACCA 


TTACACTAAA 


31020 


CGGTACACAG 


GAAACAGGAG 


ACACAACTCC 


AAGTGCATAC 


TCTATGTCAT 


TTTCATGGGA 


31080 


CTGGTCTGGC 


CACAACTACA 


TTAATGAAAT 


ATTTGCCACA 


TCCTCTTACA 


CTTTTTCATA 


31140 


CATTGCCCAA 


GAATAAAGAA 


TCGTTTGTGT 


TATGTTTCAA 


CGTGTTTATT 


TTTCAATTGC 


31200 


AGAAAATTTC 


AAGTCATTTT 


TCATTCAGTA 


GTATAGCCCC 


ACCACCACAT 


AGCTTATACA 


31260 


GATCACCGTA 


CCTTAATCAA 


ACTCACAGAA 


CCCTAGTATT 


CAACCTGCCA 


CCTCCCTCCC 


31320 


AACACACAGA 


GTACACAGTC 


CTTTCTCCCC 


GGCTGGCCTT 


AAAAAGCATC 


ATATCATGGG 


31380 


TAACAGACAT 


ATTCTTAGGT 


GTTATATTCC 


ACACGGTTTC 


CTGTCGAGCC 


AAACGCTCAT 


31440 


CAGTGATATT 


AATAAACTCC 


CCGGGCAGCT 


CACTTAAGTT 


CATGTCGCTG 


TCCAGCTGCT 


31500 


GAGCCACAGG 


CTGCTGTCCA 


ACTTGCGGTT 


GCTTAACGGG 


CGGCGAAGGA 


GAAGTCCACG 


31560 


CCTACATGGG 


GGTAGAGTCA 


TAATCGTGCA 


TCAGGATAGG 


GCGGTGGTGC 


TGCAGCAGCG 


31620 


CGCGAATAAA 


CTGCTGCCGC 


CGCCGCTCCG 


TCCTGCAGGA 


ATACAACATG 


GCAGTGGTCT 


31680 


CCTCAGCGAT 


GATTCGCACC 


GCCCGCAGCA 


TAAGGCGCCT 


TGTCCTCCGG 


GCACAGCAGC 


31740 


GCACCCTGAT 


CTCACTTAAA 


TCAGCACAGT 


AACTGCAGCA 


CAGCACCACA 


ATATTGTTCA 


31800 


AAATCCCACA 


GTGCAAGGCG 


CTGTATCCAA 


AGCTCATGGC 


GGGGACCACA 


GAACCCACGT 


31860 


GGCCATCATA 


CCACAAGCGC 


AGGTAGATTA 


AGTGGCGACC 


CCTCATAAAC 


ACGCTGGACA 


31920 


TAAACATTAC 


CTCTTTTGGC 


ATGTTGTAAT 


TCACCACCTC 


CCGGTACCAT 


ATAAACCTCT 


31980 


GATTAAACAT 


GGCGCCATCC 


ACCACCATCC 


TAAACCAGCT 


GGCCAAAACC 


TGCCCGCCGG 


32040 


CTATACACTG 


CAGGGAACCG 


GGACTGGAAC 


AATGACAGTG 


GAGAGCCCAG 


GACTCGTAAC 


32100 


CATGGATCAT 


CATGCTCGTC 


ATGATATCAA 


TGTTGGCACA 


ACACAGGCAC 


ACGTGCATAC 


32160 


ACTTCCTCAG 


GATTACAAGC 


TCCTCCCGCG 


TTAGAACCAT 


ATCCCAGGGA 


ACAACCCATT 


32220 


CCTGAATCAG 


CGTAAATCCC 


ACACTGCAGG 


GAAGACCTCG 


CACGTAACTC 


ACGTTGTGCA 


32280 


TTGTCAAAGT 


GTTACATTCG 


GGCAGCAGCG 


GATGATCCTC 


CAGTATGGTA 


GCGCGGGTTT 


32340 


CTGTCTCAAA 


AGGAGGTAGA 


CGATCCCTAC 


TGTACGGAGT 


GCGCCGAGAC 


AACCGAGATC 


32400 


GTGTTGGTCG 


TAGTGTCATG 


CCAAATGGAA 


CGCCGGACGT 


AGTCATATTT 


CCTGAAGCAA 


32460 


AACCAGGTGC 


GGGCGTGACA 


AACAGATCTG 


CGTCTCCGGT 


CTCGCCGCTT 


AGATCGCTCT 


32520 


GTGTAGTAGT 


TGTAGTATAT 


CCACTCTCTC 


AAAGCATCCA 


GGCGCCCCCT 


GGCTTCGGGT 


32580 
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TCTATGTAAA 


CTCCTTCATG 


CGCCGCTGCC 


CTGATAACAT 


CCACCACCGC AGAATAAGCC 


3264 C 


ACACCCAGCC 


AACCTACACA 


TTCGTTCTGC 


GAGTCACACA 


CGGGAGGAGC GGGAAGAGCT 


32700 


GGAAGAACCA 


TGTTTTTTTT 


TTTATTCCAA 


AAGATTATCC 


AAAACCTCAA AATGAAGATC 


32760 


TATTAAGTGA 


ACGCGCTCCC 


CTCCGGTGGC 


GTGGTCAAAC 


TCTACAGCCA AAGAACAGAT 


32820 


AATGGCATTT 


GTAAGATGTT 


GCACAATGGC 


TTCCAAAAGG 


CAAACGGCCC TCACGTCCAA 


32880 


GTGGACGTAA 


AGGCTAAACC 


CTTCAGGGTG 


AATCTCCTCT 


ATAAACATTC CAGCACCTTC 


32940 


AACCATGCCC 


AAATAATTCT 


CATCTCGCCA 


CCTTCTCAAT 


ATATCTCTAA GCAAATCCCG 


33000 


AATATTAAGT 


CCGGCCATTG 


TAAAAATCTG 


CTCCAGAGCG 


CCCTCCACCT TCAGCCTCAA 


3306 0 


GCAGCGAATC 


ATGATTGCAA 


AAATTCAGGT 


TCCTCACAGA 


CCTGTATAAG ATTCAAAAGC 


J J ± z u 


GGAACATTAA 


CAAAAATACC 


GCGATCCCGT 


AGGTCCCTTC 


GCAGGGCCAG CTGAAPATAA 


J J 1 O 0 


TCGTGCAGGT 


CTGCACGGAC 


CAGCGCGGCC 


APTTPppppp 


CAGGAACCTT GACAAAziP.nL 


33240 


CCCACACTGA 


TTATGACACG 


CATACTCGGA 


G TTATf; PT A A 


CCAGCGTAGC CCCGATGTAA 


■J "i *3 ft n 
6 J J 00 


GCTTTGTTGC 


ATGGGCGGCG 


ATATAAAATG 


CAAGGTGPTP, 


CTCAAAAAAT CAGGCAAAGC 


360 


CTCGCGCAAA 


AAAGAAAGCA 


CATCGTAGTC 


ATG CTCATC5P 


AGATAAAGGC AGGTAAQCTC 




CGGAACCACC 


ACAGAAAAAG 


ACACCATTTT 


TCTCTPAAAP 


ATGTCTGCGG GTTTCTGCAT 


J .3 *i 0 U 


AAACACAAAA 


TAAAATAACA 


AAAAAACATT 


TAAACATTAG 


AAGCCTGTCT TACAACAGGA 




AAAACAACCC 


TTATAAGCAT 


AAGACGGACT 


ACGGCPATftP 


CGGCGTGACC GTAAAAAAac 


J JbUU 


TGGTCACCGT 


GATTAAAAAG 


CACCACCGAC 


AGCTCCTPGfi 


TCATGTCCGG AGTCATAATp 


TICCA 

J -5 b o U 


TAAGACTCGG 


TAAACACATC 


AGGTTGATTC 


ATPGHTPAP.T 


GCTAAAAAGC GACCGAAATA 


n 7 *5 n 


GCCCGGGGGA 


ATACATACCC 


GCAGGCGTAG 


AGACAACATT 


ACAGCCCCCA TAGGAGGTAT 


J J / HU 


AACAAAATTA 


ATAGGAGAGA 


AAAACACATA 


AACAPPTRAA 


AAACCCTCCT GCCTAGGCAA 




AATAGCACCC 


TCCCGCTCCA 


GAACAACATA 


cagp(5pttpa 


CAGCGGCAGC CTAACAGTCA 




GCCTTACCAG 


TAAAAAAGAA 


AACCTATTAA 


AAAAAPAPPA 


CTCGACACGG CACCAGCTCA 




ATCAGTCACA 


GTGTAAAAAA 


GGGCCAAGTG 


CAGAGrP,Ar;T 


ATATATAGGA CTAAAAAATG 




ACGTAACGGT 


TAAAGTCCAC 


AAAAAACACC 


PAftAA aappp 


CACGCGAACC TACGCCCAGA 


J ** w o \j 


AACGAAAGCC 


AAAAAACCCA 


CAACTTCCTC 


AAATCGTCAC 


TTCCGTTTTC CCACGTTAPr; 


14 U 


TAACTTCCCA 


TTTTAAGAAA 


ACTACAATTC 




CAAGTTACTC CGCCCTAAAA 


34200 


CCTACGTCAC 


CCGCCCCGTT 


CCCACGCCCC 


GCGCCACGTC 


ACAAACTCCA CCCCCTCATT 


34260 


ATCATATTGG 


CTTCAATCCA 


AAATAAGGTA 


TATTATTGAT 


GAT 


34303 


(2) INFORMATION FOR SEQ ID NO : 5 : 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:S: 
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*-CAL»uL i i * A 


CACTTTATGC 


TTCCGGCTCG 


TATGTTGTGT 


GGAATTGTGA 


GCGGATAACA 


60 


Mill LALi-iCA 


GGAAACAGCT 


ATGACCATGA 


TTACGCCAAG 


CGCGCAATTA 


ACCCTCACTA 


120 


HPi\j \j Lj aa c aa 


AAGCTGGGTA 


CCGGGCCCCC 


CCTCGAGGTC 


GACGGTATCG 


ATAAGCTTAC 


180 


VjLtjTbGu CTA 


GGCGGCCGAA 


TTCCTGCAGC 


CCGGGGGATC 


CACTAGTTCT 


AGAGCGGCCG 


240 


CCACCGCGGC 


GCCTTAATTA 


ATACGTAAGC 


TCCAATTCGC 


CCTATAGTGA 


GTCGTATTAC 


300 


GCGCGCTCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


CGTTACCCAA 


360 


CTTAATCGCC 


TTGCAGCACA 










380 


(2) INFORMATION FOR SEQ ID NO : 6 : 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(xii SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

TGCCGCAGCA CCGGATGCAT C 

(2) INFORMATION FOR SEQ ID NO : 7 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GCGTCCGGAG GCTGCCATGC GGCAGGG 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14B1 base oairs 

(B) TYPE: nucleic acid* 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



GGCTGCCATG 


CGGCAGGGAT 


ACGGCGCTAA 


CGATGCATCT 


CAACAATTGT 


TGTGTAGGTA 


60 


CTCCGCCGCC 


GAGGGACCTG 


AGCGAGTCCG 


CATCGACCGG 


ATCGGAAAAC 


CTCTCGAGAA 


120 


AGGCGTCTAA 


CCAGTCACAG 


TCGCAAGGTA 


GGCTGAGCAC 


CGTGGCGGGC 


GGCAGCGGGC 


180 


GGCGGTCGGG 


GTTGTTTCTG 


GCGGAGGTGC 


TGCTGATGAT 


GTAATTAAAG 


TAGGCGGTCT 


240 


TGAGACGGCG 


GATGGTCGAC 


AGAAGCACCA 


TGTCCTTGGG 


TCCGGCCTGC 


TGAATGCGCA 


300 


GGCGGTCGGC 


CATGCCCCAG 


GCTTCGTTTT 


GACATCGGCG 


CAGGTCTTTG 


TAGTAGTCTT 


360 


GCATGAGCCT 


TTCTACCGGC 


ACTTCTTCTT 


CTCCTTCCTC 


TTGTCCTGCA 


TCTCTTGCAT 


420 
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CTATCGCTGC 


GGCGGCGGCG 


GAGTTTGGCC 


GTAGGTGGCG 


CCCTrTTPCT 


CCCATGCGTG 


4 8 0 


TGACCCCGAA 


GCCCCTCATC 


GGCTGAAGCA 


GGGCTAGGTC 


GGCGACAAPG 


CGCTCGGCTA 


54 0 


ATATGGCCTG 


CTGCACCTGC 


GTGAGGGTAG 


ACTGGAAGTC 


ATCCATGTPP 


ACAAAGCGGT 


600 


GGTATGCGCC 


CGTGTTGATG 


GTGTAAGTGC 


AGTTGGCCAT 


AACGGACCAG 


TTAACGGTCT 


660 


GGTGACCCGG 


CTGCGAGAGC 


TCGGTGTACC 


TGAGACGCGA 


GTAAGPPPTP 

vj i. rxn\3 LUC 1L 


GAGTCAAATA 




CGTAGTCGTT 


GCAAGTCCGC 


ACCAGGTACT 


GGTATCCCAC 


CAAAAAGTP,P 


GGCGGCGGCT 


Ton 


GGCGGTAGAG 


GGGCCAGCGT 


AGGGTGGCCG 


GGGCTCCGGG 


gcicgixcz atpt 


TCCAACATAA 


o4 0 


GGCGATGATA 


TCCGTAGATG 


TACCTGGACA 


TCCAGGTGAT 


f? p p ft r2 p fs<^ r*^ 


vj a vjvj i ooMub 


900 


CGCGCGGAAA 


GTCGCGGACG 


CGGTTCCAGA 




LGGLAAAAAG 


TGPTPPATPH 


960 


TCGGGACGCT 


CTGGCCGGTC 


AGGCGCGCGC 


AATCGTTGAP 


oLIL I I 


GCA&A^pr.ar 

VJ V- r\ArtM W O rtO 


1020 


AGCCTGTAAG 


CGGGCACTCT 


TCCGTGGTCT 


GGTGGATAAA 


i H-GLAAGGG 


* "* * X VJVJ CO 


luo 0 


GACGACCGGG 


GTTCGAGCCC 


CGTATCCGGC 


CGTCCGCCGT 






J. J. 4 U 


GCGTGTCGAA 


CCCAGGTGTG 


CGACGTCAGA 


CAACGGGGGA 


CZ TC2 P T fPTT T 
va I vjl_ ILL! I I 


TGGPTTPPTT 




CCAGGCGCGG 


CGGCTGCTGC 


GCTAGCTTTT 


TTGGCCAPTH 




vjv_\j l hMuLbu 


1260 


TTAGGCTGGA 


AAGCGAAAGC 


ATTAAGTGGC 


I LuL 1 LLl I vj 


TAGCCGGAGG 


vj X i AI TTTCC 


1320 


AAGGGTTGAG 


TCGCGGGACC 


CCCGGTTCdJx 


ulCJ LvjvjALL 


GGCCGGACTG 


CGGCGAACGG 


1380 


GGGTTTGCCT 


CCCCGTCATG 


CAAGACCCCG 


CTTGCAAATT 


CCTCCGGAAA 


CAGGGACGAG 


1440 


CCCCTTTTTT 


GCTTTTCCCA 


GATGCATCCG 


GTGCTGCGGC 


A 




1481 


(2) INFORMATION FOR SEQ ID NO : 9 : 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3364 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

GAATTCCCCA TCCTGGTCTA TAGAGAGAGT TCCAGAACAG CCAGGGCTAC AGATAAACCC 

ATCTGGAAAA ACAAAGTTGA ATGACCCAAG AGGGGTTCTC AGAGGGTGGC GTGTGCTCCC 

TGGCAAGCCT ATGACATGGC CGGGGCCTGC CTCTCTCTGC CTCTGACCCT CAGTGGCTCC 

CATGAACTCC TTGCCCAATG GCATCTTTTT CCTGCGCTCC TTGGGTTATT CCAGTCTCCC 

CTCAGCATTC CTTCCTCAGG GCCTCGCTCT TCTCTCTGCT CCCTCCTTGC ACAGCTGGCT ' 

CTGTCCACCT CAGATGTCAC AGTGCTCTCT CAGAGGAGGA AGGCACCATG TACCCTCTGT 360 

TTCCCAGGTA AGGGTTCAAT TTTTAAAAAT GGTTTTTTGT TTGTTTGTTT GTTTGTTTGT 42 0 

TTGTTTGTTT TTCAAGACAG GGCTCCTCTG TGTAGTCCTA ACTGTCTTGA AACTCCCTCT 4 80 

GTAGACCAGG TCGACCTCGA ACTCTTGAAA CCTGCCACGG ACCACCCAGT CAGGTATGGA 54 0 

GGTCCCTGGA ATGAGCGTCC TCGAAGCTAG GTGGGTAAGG GTTCGGCGGT GACAAACAGA 600 

AACAAACACA GAGGCAGTTT GAATCTGAGT GTATTTTGCA GCTCTCAAGC AGGGGATTTT 660 
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ATACATAAAA 


AAAAAAAAAA 


AAAAAAAACC 


AAACATTACA 


TCTCTTAGAA ACTATATCCA 


720 


ATGAAACAAT 


CACAGATACC 


AACCAAAACC 


ATTGGGCAGA 


GTAAAGCACA AAAATCATCC 


* 780 


AAGCATTACA 


ACTCTGAAAC 


CATGTATTCA 


GTGAATCACA 


AACAGAACAG GTAACATCAT 


840 


TATTAATATA 


AATCACCAAA 


ATATAACAAT 


TCTAAAAGGA 


TGTATCCAGT GGGGGCTGTC 


900 


GTCCAAGGCT 


AGTGGCAGAT 


TTCCAGGAGC 


AGGTTAGTAA 


ATCTTAACCA 


CTGAACTAAC 


960 


TCTCCAGCCC 


CATGGTCAAT 


TATTATTTAG 


CATCTAGTGC 


CTAATTTTTT 


TTTATAAATC 


1020 


TTCACTATGT 


AATTTAAAAC 


TATTTTAATT 


CTTCCTAATT 


AAGGCTTTCT 


TTACCATATA 


1080 


CCAAAATTCA 


CCTCCAATGA 


CACACGCGTA 


GCCATATGAA 


ATTTTATTGT 


TGGGAAAATT 


1140 


TGTACC7ATC 


ATAATAGTTT 


TGTAAATGAT 


TTAAAAAGCA 


AAGTGTTAGC 


CGGGCGTGGT 


1200 


GGCACACGCC 


TTTAATCCCT 


GCACTCGGGA 


GGCAGGGGCA 


GGAGGATTTC 


TGAGTTTGAG 


1260 


GCCAGCCTGG 


TCTACAGAGT 


GAGTTCCAGG 


ACAGCCAGGG 


CTACACAGAG 


AAACCCTGTC 


1320 


TCGAACCCCC 


CACCCCCCAA 


AAAAAGCAAA 


GTGTTGGTTT 


CCTTGGGGAT 


AAAGTCATGT 


1380 


TAGTGGCCCA 


TCTCTAGGCC 


CATCTCACCC 


ATTATTCTCG 


CTTAAGATCT 


TGGCCTAGGC 


1440 


TACCAGGAAC 


ATGTAAATAA 


GAAAAGGAAT 


AAGAGAAAAC 


AAAACAGAGA 


GATTGCCATG 


1500 


AGAACTACGG 


CTCAATATTT 


TTTCTCTCCG 


GCGAAGAGTT 


CCACAACCAT 


CTCCAGGAGG 


1560 


CCTCCACGTT 


TTGAGGTCAA 


TGGCCTCAGT 


CTGTGGAACT 


TGTCACACAG 


ATCTTACTGG 


1620 


AGGTGGTGTG 


GCAGAAACCC 


ATTCCTTTTA 


GTGTCTTGGG 


CTAAAAGTAA 


AAGGCCCAGA 


1680 


GGAGGCCTTT 


GCTCATCTGA 


CCATGCTGAC 


AAGGAACACG 


GGTGCCAGGA 


CAGAGGCTGG 


1740 


ACCCCAGGAA 


CACCTTAAAC 


ACTTCTTCCC 


TTCTCCGCCC 


CCTAGAGCAG 


GCTCCCCTCA 


1800 


CCAGCCTGGG 


CAGAAATGGG 


GGAAGATGGA 


GTGAAGCCAT 


ACTGGCTACT 


CCAGAATCAA 


I860 


CAGAGGGAGC 


CGGGGGCAAT 


ACTGGAGAAG 


CTGGTCTCCC 


CCCAGGGGCA 


ATCCTGGCAC 


1920 


CTCCCAGGCA 


GAAGAGGAAA 


CTTCCACAGT 


GCATCTCACT 


TCCATGAATC 


CCCTCCTCGG 


I960 


ACTCTGAGGT 


CCTTGGTCAC 


AGCTGAGGTG 


CAAAAGGCTC 


CTGTCATATT 


GTGTCCTGCT 


2040 


CTGGTCTGCC 


TTCCACAGCT 


TGGGGGCCAC 


CTAGCCCACC 


TCTCCCTAGG 


GATGAGAGCA 


2100 


GCCACTACGG 


GTCTAGGCTG 


CCCATGTAAG 


GAGGCAAGGC 


CTGGGGACAC 


CCGAGATGCC 


2160 


TGGTTATAAT 


TAACCCAGAC 


ATGTGGCTGC 


CCCCCCCCCC 


CCAACACCTG 


CTGCCTGAGC 


2220 


CTCACCCCCA 


CCCCGGTGCC 


TGGGTCTTAG 


GCTCTGTACA 


CCATGGAGGA 


GAAGCTCGCT 


2280 


CTAAAAATAA 


CCCTGTCCCT 


GGTGGATCCA 


GGGTGAGGGG 


CAGGCTGAGG 


GCGGCCACTT 


2340 


CCCTCAGCCG 


CAGGTTTGTT 


TTCCCAAGAA 


TGGTTTTTCT 


GCTTCTGTAG 


CTTTTCCTGT 


2400 


CAATTCTGCC 


ATGGTGGAGC 


AGCCTGCACT 


GGGCTTCTGG 


GAGAAACCAA 


ACCGGGTTCT 


2460 


AACCTTTCAG 


CTACAGTTAT 


TGCCTTTCCT 


GTAGATGGGC 


GACTACAGCC 


CCACCCCCAC 


2520 


CCCCGTCTCC 


TGTATCCTTC 


CTGGGCCTGG 


GGATCCTAGG 


CTTTCACTGG 


AAATTTCCCC 


2580 


CCAGGTGCTG 


TAGGCTAGAG 


TCACGGCTCC 


CAAGAACAGT 


GCTTGCCTGG 


CATGCATGGT 


2640 


TCTGAACCTC 


CAACTGCAAA 


AAATGACACA 


TACCTTGACC 


CTTGGAAGGC 


TGAGGCAGGG 


2700 


GGATTGCCAT 


GAGTGCAAAG 


CCAGACTGGG 


TGGCATAGTT 


AGACCCTGTC 


TCAAAAAACC 


2760 
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AAAAACAATT 


AAATAACTAA 


AGTCAGGCAA 


GTAATCCTAC 


TCGGGAHAPT 


GAGGCAGAGG 


2820 


GATTGTTACA 


TGTCTGAGGC 


CAGCCTGGAC 


TACATAGGGT 




CCCTGTCTAC 


2880 


AGAGTAAGGC 


CCTATTTCAA 


AAACACAAAC 


AAAATGGTTC 


TPPPZic^ptpp 


TAATGCTCAC 


294 0 


CAGGCAATGA 


AGCCTGGTGA 


GCATTAGCAA 


TGAAGGCAAT 


G AAf; r; 21 p. p. pt 


GCTGGCTACA 


3000 


ATCAAGGCTG 


TGGGGGACTG 


AGGGCAGGCT 


GTAACAGGCT 


TGGGGGPPAP, 


GGCTTATACG 


3060 


TGCCTGGGAC 


TCCCAAAGTA 


TTACTGTTCC 


ATGTTCCCGG 


CGAAClf^f^PPA 


GCTGTCCCCC 


3120 


GCCAGCTAGA 


CTCAGCACTT 


AGTTTAGGAA 


CCAGTGAGCA 


APTPZ1PPPPT 


TGGGGCAGPP 


11 on 


CATACAAGGC 


CATGGGGCTG 


GGCAAGCTGC 


ttvUvx, 1 VJVJO 1 


Lv-btaGGTGGG 


CACGGTC5PPP 


1 "> A Pi 


GGGCAACGAG 


CTGAAAGCTC 


ATCTGPTPTP 




CCCTGGGGAC 


AGCCCCTCCT 


3300 


GGCTAGTCAC 


ACCCTGTAGG 


CTCCTCTATA 


TAACCCAGGG 


GCACAGGGGC 


TGCCCCCGGG 


3360 


TCAC 












3364 



60 
120 
174 



(2) INFORMATION FOR SEQ ID NO: 10: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGTACCACTA CGGGTCTAGG CTGCCCATGT AAGGAGGCAA GGCCTGGGGA CACCCGAGAT 
GCCTGGTTAT AATTAACCCC AACACCTGCT GCCCCCCCCC CCCCAACACC TGCTGCCTGA 
GCCTGAGCGG TTACCCCACC CCGGTGCCTG GGTCTTAGGC TCTGTACACC ATGG 
(2) INFORMATION FOR SEQ ID NO: 11: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 699 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

<xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

AGATCTTCCT AGAAATTCGC TGTCTGCGAG GGCCGGCTGT TGGGGTGAGT ACTCCCTCTC 6 0 

AAAAGCGGGC ATGACTTCTG CGCTAAGATT GTCAGTTTCC AAAAACGAGG AGGATTTGAT 120 

ATTCACCTGG CCCGCGGTGA TGCCTTTGAG GGTGGCCGCG TCCATCTGGT CAGAAAAGAC 180 

AATCTTTTTG TTGTCAAGCT TGAGGTGTGG CAGGCTTGAG ATCGATCTGG CCATACACTT 240 

GAGTGACAAT GACATCCACT TTGCCTTTCT CTCCACAGGT GTCCACTCCC AGGTCCAACC 3 00 

GCGGATCTCC CGGGACCATG CCCAAGAAGA AGAGGAAGGT GTCCAATTTA CTGACCGTAC 36 0 

ACCAAAATTT GCCTGCATTA CCGGTCGATG CAACGAGTGA TGAGGTTCGC AAGAACCTGA 420 

TGGACATGTT CAGGGATCGC CAGGCGTTTT CTGAGCATAC CTGGAAAATG CTTCTGTCCG 4 80 
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TTTGCCGGTC GTGGGCGGCA TGGTGCAAGT TGAATAACCG GAAATGGTTT CCCGCAGAAC 54 
CTGAAGATGT TCGCGATTAT CTTCTATATC TTCAGGCGCG CGGTCTGGCA GTAAAAACTA 60 
TCCAGCAACA TTTGGGCCAG CTAAACATGC TTCATCGTCG GTCCGGGCTG CCACGACCAA 66 
GTGACAGCAA TGCTGTTTCA CTGGTTATGC GGCGGATCC 

6 9 9 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GAAGATCTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATTACCGAAG AAATGGCTCG 6 0 

AGATCTTC 

66 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = " DNA " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GAAGATCTCG AGCCATTTCT TCGGTAATAA CTTCGTATAG CATACATTAT ACGAAGTTAT 6 0 

AGATCTTC 

68 

(2) INFORMATION FOR SEQ ID NO:14: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA " 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCACATGTAT AACTTCGTAT AGCATACATT ATACGAAGTT ATACATGTGG 50 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(A) DESCRIPTION: /desc = "DNA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCACATGTAT AACTTCGTAT AATGTATGCT ATACGAAGTT ATACATGTGG 50 
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!• A recombinant plasmid. comprising in operable combination: 

a) a plasmid backbone, comprising an origin of replication, an 
antibiotic resistance gene and a eukaryotic promoter element; 

b) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5' and a 3' end and arranged in a tail 
to tail orientation on said plasmid backbone: 

c) the adenovirus packaging sequence, said packaging sequence 
having a 5 ; and a 3' end and linked to one of said ITRs: and 

d) a first gene of interest operably linked to said promoter 
element. 



2. The recombinant plasmid of Claim I wherein the total size of said 
plasmid is between 27 and 40 kilobase pairs. 

3. The recombinant plasmid of Claim I. wherein said 5" end of said 
packaging sequence is linked to said 3* end of said left ITR. 

4. The recombinant plasmid of Claim 3. wherein said first gene of 
interest is linked to said 3 ? end of said packaging sequence. 

The recombinant plasmid of Claim 4. wherein said first gene of 
interest is the dystrophin cDNA gene. 

6. The recombinant plasmid of Claim 4, further comprising a second 
gene of interest. 

7. The recombinant plasmid of Claim 6. wherein said second gene of 
interest is linked to said 3* end of said right ITR. 

8. The recombinant plasmid of Claim 7. wherein said second gene of 
interest is a reporter gene. 
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9. The recombinant plasmid of Claim 8. wherein said reporter gene is 
selected from the group consisting of the £. coli P-galactosidase gene, the human 
placental alkaline phosphatase gene, the green fluroescent protein gene and the 
choramphenicol acetyltransferase gene. 

3 10. A mammalian ceil line containing the recombinant plasmid of 

Claim I. 



11. The cell line of Claim 10, wherein said cell line is a 293-derived cell 

line. 

12. A helper adenovirus comprising i) first and a second loxP sequences, 
and ii) the adenovirus packaging sequence, said packaging sequence having a 5' 
and a 3' end. 

13. The helper adenovirus of Claim 12. wherein said first loxP sequence 
is linked to the 5' end of said packaging sequence and said second loxP sequence is 
linked to said 3' end of said packaging sequence. 

14. The helper adenovirus of Claim 12. further comprising at least one 
adenovirus gene coding region. 

15. A mammalian cell line, comprising: 

a) a recombinant plasmid. comprising, in operable combination: 

i) a plasmid backbone, comprising an origin of 
replication, an antibiotic resistance gene and a eukaryotic promoter 
element. 

ii) the left and right inverted terminal repeats (lTRs) of 
adenovirus, said ITRs each having a 5" and a 3' end and arranged in 
a tail to tail orientation on said plasmid backbone. 

iii) the adenovirus packaging sequence, said packaging 
sequence having a 5' and a 3" end and linked to one of said ITRs. 
and 



15 



20 
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iv) a first gene of interest operably linked to said 
promoter element: and 

b) a helper adenovirus comprising i) first and a second loxP 
sequences, and ii) the adenovirus packaging sequence, said packaging 
5 sequence having a 5" and a Y end. 

1 6. The cell line of Claim 1 5. wherein said helper adenovirus further 
comprises at least one adenovirus gene coding region. 

17. The cell line of Claim 15. wherein said first gene of interest is the 
dystrophin cDNA gene 

^ 1 8. A method of producing an adenovirus minichromosome. comprising: 

A) providing a mammalian cell line containing: 

a) a recombinant plasmid. comprising, in operable 
combination, i) a plasmid backbone, comprising an origin of 
replication, an antibiotic resistance gene and a eukarotic promoter 
element, ii) the left and right inverted terminal repeats (ITRs) of 
adenovirus, said ITRs each having a 5 ! and a 3* end and arranged in 
a tail to tail orientation on said plasmid backbone, iii) the adenovirus 
packaging sequence, said packaging sequence having a 5 - and a 3' 
end and linked to one of said ITRs. and iv) a first gene of interest 
operably linked to said promoter element: and 

b) a helper adenovirus comprising i) first and a second 
loxP sequences, ii) at least one adenovirus gene coding region, and 
iii) the adenovirus packaging sequence, said packaging sequence 
having a 5' and a 3' end; and 

B) growing said cell line under conditions such that said 
adenovirus gene coding region is expressed and said recombinant plasmid 
directs the production of at least one adenoviral minichromosome. 

1 9. The method of Claim 1 8, wherein said adenovirus minichromosome 
is encapidated. 
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20. The method of Claim 18. further comprising 3) recovering said 
encapsidated adenovirus minichromosome. 



21. The method of Claim 20. further comprising 4) purifiying said 
recovered encapsidated adenovirus minichromosome. 

* 22. method of Claim 21. further comprisng 5) administerinc said 

purified encapsidated adenovirus minichromosome to a host. 

23. The method of Claim 22. wherein said host is a mammal. 

24. The method of Claim 23, wherein said mammal is a human. 

25. A recombinant adenovirus comprising the adenovirus E2b region 
10 having a deletion, said adenovirus capable of self-propagation in a packaging cell 

line and said E2b region comprising the DNA polymerase gene and the adenovirus 
preterminal protein gene. 

26. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase gene. 

15 27. The recombinant adenovirus of Claim 25. wherein said deletion is 

within the adenovirus preterminal protein gene. 

28. The recombinant adenovirus of Claim 25. wherein said deletion is 
within the adenovirus DNA polymerase and preterminal protein genes. 

29. A mammalian cell line stably and constitutively expressing the 
20 adenovirus El gene products and the adenovirus DNA polymerase. 

30. The cell line of Claim 29. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 
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3 1 ■ The cell line of Claim 30. wherein said deletion within said E2b 
region comprises a deletion within the adenoviral DNA polymerase gene. ' 

32. The cell of Claim 29. wherein the genome of said cell line contains 
a nucleotide sequence encoding adenovirus DNA polymerase operably linked to a 
5 heterologous promoter. 



33. The cell line of Claim 32. wherein said cell line is selected from the 
group consisting of the B-6. B-9, C-l. C-4. C-7. C-13. and C-14 cell lines. 

34. The cell line of Claim 29 further constitutively expressing the 
adenovirus preterminal protein gene product. 

35. The cell line of Claim 34. wherein said cell line comprises a 
recombinant adenovirus comprising a deletion within the E2b region, said 
recombinant adenovirus being capable of self-propagation in said cell line. 



reeion 



36. The cell line of Claim 35. wherein said deletion within said E2b 
comprises a deletion within the adenoviral preterminal protein gene. 



1 5 31 ■ The ceil line of c 'aim 36. wherein said deletion within said E2b 

region comprises a deletion within the adenoviral DNA polymerase and preterminal 
protein genes. 

38. The cell line of Claim 34. wherein the genome of said cell line 
contains a nucleotide sequence encoding adenovirus preterminal protein operably 

20 linked to a heterologous promoter. 

39. The cell line of Claim 38. wherein said cell line is selected from the 
group consisting of the C-l. C-4. C-7. C-13. and C-14 cell lines. 
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40. A method of producing infectious recombinant adenovirus particles 
containing an adenoviral genome containing a deletion within the E2b region, 
comprising 

a) providing: 

5 i) a mammalian cell line stably and constitutiveiy 

expressing the adenovirus El gene products and the adenovirus DNA 
polymerase; 

u) a recombinant adenovirus comprising a deletion within 
the E2b region, said recombinant adenovirus being capable of self- 
0 propagation in said cell line; 

b) introducing said recombinant adenovirus into said cell line 
under conditions such that said recombinant adenovirus is propagated to 
form infectious recombinant adenovirus particles; and 

c) recovering said infectious recombinant adenovirus panicles. 

41. The method of Claim 40, further comprising d) purifying said 
recovered infectious recombinant adenovirus particles. 

42. The method of Claim 41, further comprising e) administering said 
purified recombinant adenovirus particles to a host. 



43. The method of Claim 40. wherein said mammalian cell line further 
20 constitutiveiy expresses the adenovirus preterminal protein. 

44. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequences, said E2b sequences containing a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

45. The recombinant plasmid of Claim 44. wherein said deletion 
25 comprises a deletion of nucleotides 8772 to 9385 in SEQ ID NO:4. 

46. The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pApol. 
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4 /• The recombinant plasmid of Claim 45. wherein said plasmid has the 
designation pBHGl lApol. 



48. A recombinant plasmid capable of replicating in a bacterial host 
comprising adenoviral E2b sequence, said E2b sequences containing a deletion 
within the preterminal protein gene, said deletion resulting in the inability to 
express functional preterminal protein without disruption of the V A RNA genes. 

49. The recombinant plasmid of Claim 48. wherein said deletion within 
satd preterminal protein gene comprises a deletion of nucleotides 10,705 to 11 134 
in SEQ ID NO:4. 



50. The recombinant plasmid of Claim 49. wherein said plasmid has the 

designation pApTP. 

51. The recombinant plasmid of Claim 49, wherein said plasmid has the 
designation pBHGl 1 ApTP. 

52. The recombinant plasmid of Claim 48 further comprising a deletion 
within the polymerase gene, said deletion resulting in reduced polymerase activity. 

53. The recombinant plasmid of Claim 52. wherein said deletion within 
said polymerase and said preterminal protein genes comprises a deletion of 
nucleotides 8.773 to 9586 and 1 1.067 to 12.513 in SEQ ID NO:4. 

54. The recombinant plasmid of Claim 53, wherein said plasmid has the 
20 designation pAXBApolApTPVARNA+tl3. 

55. The recombinant plasmid of Claim 53, wherein said plasmid has the 
designation pBHGl 1 ApolApTPVARNA+tl3. 
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FIGURE 5 
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FIGURE 6 



293 B-6 C-7 




WO 98/17783 



7/19 



PCT/US97/1954] 



FIGURE 7 



32.0°C 
293 B-6 0-1 



38.5°C 
293 B-6 0-7 



WO 98/17783 



8/19 



PCT/US97/19541 



FIGURE 8 
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FIGURE 10 
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FIGURE 12 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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